OpenCL CUDA NVIDIA GPGPU Linux Tests

Running pts/shoc-1.0.0, pts/askap-1.0.0, pts/cuda-mini-nbody-1.0.0, pts/juliagpu-1.3.0, pts/mandelbulbgpu-1.3.0 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/1812085-KH-1511113PT12&gru&export=txt&rdt&rro.

OpenCL CUDA NVIDIA GPGPU Linux TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGeForce GTX 950GeForce GTX 980 TiGeForce GTX 970GeForce GTX 980GeForce GTX 960GeForce GTX TITAN XGeForce GTX 780 TiGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX Titan Xp Oct-OffGeForce GTX Titan Xp Oct-Swappiness 100Intel Core i5-6600K @ 3.50GHz (4 Cores)MSI Z170A GAMING PRO (MS-7984) v1.0Intel Device 191f16384MB256GB TS256GSSD370SeVGA NVIDIA GeForce GTX 950 2048MB (135/405MHz)Intel Device a170Intel Device 15b8Ubuntu 14.043.19.0-33-generic (x86_64)Unity 7.2.5X Server 1.17.1NVIDIA 352.394.3.0GCC 4.8.4 + Clang 3.4-1ubuntu3 + CUDA 7.5ext43840x2160NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz)NVIDIA GeForce GTX 980 4096MB (1126/3505MHz)eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz)NVIDIA GeForce GTX TITAN X 12288MB (1001/3505MHz)NVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz)NVIDIA GeForce GTX 680 2048MB (1006/3004MHz)eVGA NVIDIA GeForce GTX 750 1024MB (1019/2505MHz)NVIDIA GeForce GTX 760 2048MB (980/3004MHz)Intel Core i9-7920X @ 4.40GHz (24 Cores)ASUS WS X299 SAGEIntel Sky Lake-E DMI3 Registers64512MB10001GB Western Digital WD101KRYZ-01TITAN Xp 12288MB (139/405MHz)Realtek ALC1220Intel ConnectionUbuntu 18.044.15.0-42-generic (x86_64)GNOME Shell 3.28.3NVIDIA 410.484.6.0CUDA 9.11920x1080TITAN Xp 12288MB (1468/5702MHz)OpenBenchmarking.orgCompiler Details- GeForce GTX 950: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 980 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 970: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 980: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 960: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX TITAN X: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 780 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 680: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 750: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 760: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX Titan Xp Oct-Off: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - GeForce GTX Titan Xp Oct-Swappiness 100: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v Processor Details- GeForce GTX 950: Scaling Governor: acpi-cpufreq performance- GeForce GTX 980 Ti: Scaling Governor: acpi-cpufreq performance- GeForce GTX 970: Scaling Governor: acpi-cpufreq performance- GeForce GTX 980: Scaling Governor: acpi-cpufreq performance- GeForce GTX 960: Scaling Governor: acpi-cpufreq performance- GeForce GTX TITAN X: Scaling Governor: acpi-cpufreq performance- GeForce GTX 780 Ti: Scaling Governor: acpi-cpufreq performance- GeForce GTX 680: Scaling Governor: acpi-cpufreq performance- GeForce GTX 750: Scaling Governor: acpi-cpufreq performance- GeForce GTX 760: Scaling Governor: acpi-cpufreq performance- GeForce GTX Titan Xp Oct-Off: Scaling Governor: intel_pstate powersave- GeForce GTX Titan Xp Oct-Swappiness 100: Scaling Governor: intel_pstate powersaveOpenCL Details- GeForce GTX 950: GPU Compute Cores: 768- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 980: GPU Compute Cores: 2048- GeForce GTX 960: GPU Compute Cores: 1024- GeForce GTX TITAN X: GPU Compute Cores: 3072- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX 680: GPU Compute Cores: 1536- GeForce GTX 750: GPU Compute Cores: 512- GeForce GTX 760: GPU Compute Cores: 1152- GeForce GTX Titan Xp Oct-Off: GPU Compute Cores: 3840- GeForce GTX Titan Xp Oct-Swappiness 100: GPU Compute Cores: 3840System Details- GeForce GTX 950: GPU Compute Cores: 768.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX 970: GPU Compute Cores: 1664.- GeForce GTX 980: GPU Compute Cores: 2048.- GeForce GTX 960: GPU Compute Cores: 1024.- GeForce GTX TITAN X: GPU Compute Cores: 3072.- GeForce GTX 780 Ti: GPU Compute Cores: 2880.- GeForce GTX 680: GPU Compute Cores: 1536.- GeForce GTX 750: GPU Compute Cores: 512.- GeForce GTX 760: GPU Compute Cores: 1152.- GeForce GTX Titan Xp Oct-Off: GPU Compute Cores: 3840.- GeForce GTX Titan Xp Oct-Swappiness 100: GPU Compute Cores: 3840.

OpenCL CUDA NVIDIA GPGPU Linux Testsshoc: CUDA - Texture Read Bandwidthshoc: OpenCL - Texture Read Bandwidthshoc: CUDA - FFT SPshoc: OpenCL - FFT SPshoc: CUDA - MD5 Hashshoc: OpenCL - MD5 Hashaskap: Griddingaskap: Degriddingjuliagpu: GPUmandelbulbgpu: GPUluxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRcuda-mini-nbody: Originalcuda-mini-nbody: Cache Blockingcuda-mini-nbody: Loop Unrollingcuda-mini-nbody: SOA Data Layoutcuda-mini-nbody: Flush Denormals To ZeroGeForce GTX 950GeForce GTX 980 TiGeForce GTX 970GeForce GTX 980GeForce GTX 960GeForce GTX TITAN XGeForce GTX 780 TiGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX Titan Xp Oct-OffGeForce GTX Titan Xp Oct-Swappiness 100326.23239.19172.2863.222.362.343399.145706.0764913682.6337156070.8776924235313105.3049.8947.54108.50108.48348.92345.55311.46170.366.816.798320.5017380.60127978049.5371656708.83185562681380234.5819.7718.4640.9440.85325.16283.36263.14117.234.794.775325.129509.14104144917.2358811317.1713464458973754.3228.5326.4255.8755.80336.48332.60289.63140.125.705.686051.2711094113830604.2763616558.77149247761071345.3825.1323.8850.1549.53351.31269.98212.4362.783.383.363144.855290.3280042041.7344953399.478972460547482.0137.0835.3579.9779.84356.52354.09324.09173.897.427.418458.7717380.60136037921.4375614774.13190663601408132.3718.6517.5937.4337.37286.62126.713.7878839770.1347400001.909924302963961.0329.9927.0554.3953.26242.1674.971.9148074789.0331636512.9757721274554158.42121.14113.6454.691.081.0736136874.0020060275.533491180.6698.1989.34199.95199.83170.2678.441.4038310650.5025392138.5046319414253623.58626.12458.71280.2916.0115.8113464.8225818.77149218672.5087713632.1020.4011.4912.0222.3421.99635.35629.83450.23279.7216.0115.8113546.3725011.93149335524.4786921286.3020.2211.3711.9022.2722.22OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: CUDA - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: Texture Read BandwidthGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX 750GeForce GTX TITAN XGeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiGeForce GTX 950140280420560700SE +/- 2.35, N = 3SE +/- 8.97, N = 5SE +/- 0.42, N = 3SE +/- 0.12, N = 3SE +/- 0.14, N = 3SE +/- 1.15, N = 3SE +/- 0.28, N = 3SE +/- 1.22, N = 3SE +/- 0.85, N = 3635.35623.58158.42356.52351.31336.48325.16348.92326.23-std=c++14-std=c++141. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX 760GeForce GTX 750GeForce GTX 680GeForce GTX 780 TiGeForce GTX TITAN XGeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiGeForce GTX 950140280420560700SE +/- 0.78, N = 3SE +/- 1.27, N = 3SE +/- 0.28, N = 3SE +/- 0.23, N = 3SE +/- 1.02, N = 3SE +/- 0.02, N = 3SE +/- 1.56, N = 3SE +/- 0.56, N = 3SE +/- 0.20, N = 3SE +/- 0.06, N = 3SE +/- 0.21, N = 3SE +/- 0.73, N = 3629.83626.12170.26121.14242.16286.62354.09269.98332.60283.36345.55239.19-std=c++14-std=c++141. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

SHOC Scalable HeterOgeneous Computing

Target: CUDA - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: FFT SPGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX 750GeForce GTX TITAN XGeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiGeForce GTX 950100200300400500SE +/- 3.82, N = 3SE +/- 1.03, N = 3SE +/- 0.69, N = 3SE +/- 1.19, N = 3SE +/- 1.49, N = 3SE +/- 3.09, N = 3SE +/- 2.44, N = 3SE +/- 0.32, N = 3SE +/- 0.47, N = 3450.23458.71113.64324.09212.43289.63263.14311.46172.28-std=c++14-std=c++141. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX 760GeForce GTX 750GeForce GTX 680GeForce GTX 780 TiGeForce GTX TITAN XGeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiGeForce GTX 95060120180240300SE +/- 1.72, N = 3SE +/- 1.77, N = 3SE +/- 0.31, N = 3SE +/- 0.08, N = 3SE +/- 0.87, N = 3SE +/- 0.19, N = 3SE +/- 0.19, N = 3SE +/- 1.20, N = 3SE +/- 1.30, N = 3SE +/- 0.52, N = 3SE +/- 0.65, N = 3SE +/- 0.08, N = 3279.72280.2978.4454.6974.97126.71173.8962.78140.12117.23170.3663.22-std=c++14-std=c++141. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

SHOC Scalable HeterOgeneous Computing

Target: CUDA - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: MD5 HashGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX 750GeForce GTX TITAN XGeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiGeForce GTX 95048121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 316.0116.011.087.423.385.704.796.812.36-std=c++14-std=c++141. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX 760GeForce GTX 750GeForce GTX 680GeForce GTX 780 TiGeForce GTX TITAN XGeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiGeForce GTX 95048121620SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 315.8115.811.401.071.913.787.413.365.684.776.792.34-std=c++14-std=c++141. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

ASKAP tConvolveCuda

Processing: Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: GriddingGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX TITAN XGeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiGeForce GTX 9503K6K9K12K15KSE +/- 233.57, N = 3SE +/- 333.87, N = 6SE +/- 130.14, N = 4SE +/- 12.43, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 14.40, N = 313546.3713464.828458.773144.856051.275325.128320.503399.141. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

ASKAP tConvolveCuda

Processing: Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: DegriddingGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX TITAN XGeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiGeForce GTX 9506K12K18K24K30KSE +/- 806.83, N = 3SE +/- 806.83, N = 3SE +/- 369.80, N = 3SE +/- 34.80, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 369.80, N = 3SE +/- 41.05, N = 325011.9325818.7717380.605290.3211094.009509.1417380.605706.071. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX 760GeForce GTX 750GeForce GTX 680GeForce GTX 780 TiGeForce GTX TITAN XGeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiGeForce GTX 95030M60M90M120M150MSE +/- 527550.54, N = 3SE +/- 386285.03, N = 3SE +/- 14125.16, N = 3SE +/- 22546.70, N = 3SE +/- 59682.63, N = 3SE +/- 293396.06, N = 3SE +/- 318277.32, N = 3SE +/- 157475.07, N = 3SE +/- 218639.12, N = 3SE +/- 84325.23, N = 3SE +/- 473156.02, N = 3SE +/- 58084.93, N = 3149335524.47149218672.5038310650.5036136874.0048074789.0378839770.13136037921.4380042041.73113830604.27104144917.23127978049.5364913682.631. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

MandelbulbGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX 760GeForce GTX 750GeForce GTX 680GeForce GTX 780 TiGeForce GTX TITAN XGeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiGeForce GTX 95020M40M60M80M100MSE +/- 950597.06, N = 3SE +/- 188108.10, N = 3SE +/- 28089.31, N = 3SE +/- 9818.73, N = 3SE +/- 36731.70, N = 3SE +/- 48150.35, N = 3SE +/- 166919.37, N = 3SE +/- 75512.83, N = 3SE +/- 140370.89, N = 3SE +/- 91420.68, N = 3SE +/- 168304.91, N = 3SE +/- 29855.85, N = 386921286.3087713632.1025392138.5020060275.5331636512.9747400001.9075614774.1344953399.4763616558.7758811317.1771656708.8337156070.871. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelGeForce GTX 760GeForce GTX 680GeForce GTX 780 TiGeForce GTX TITAN XGeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiGeForce GTX 950400800120016002000SE +/- 0.33, N = 3SE +/- 2.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 1.20, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 34635779921906897149213461855769

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneGeForce GTX 760GeForce GTX 680GeForce GTX 780 TiGeForce GTX TITAN XGeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiGeForce GTX 95014002800420056007000SE +/- 0.67, N = 3SE +/- 3.06, N = 3SE +/- 12.00, N = 3SE +/- 3.00, N = 3SE +/- 1.15, N = 3SE +/- 0.67, N = 3SE +/- 7.64, N = 3SE +/- 18.50, N = 3SE +/- 4.26, N = 3194121274302636024604776445862682423

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX 760GeForce GTX 750GeForce GTX 680GeForce GTX 780 TiGeForce GTX TITAN XGeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiGeForce GTX 9503K6K9K12K15KSE +/- 1.45, N = 3SE +/- 11.67, N = 3SE +/- 12.17, N = 3SE +/- 35.97, N = 3SE +/- 4.70, N = 3SE +/- 0.88, N = 3SE +/- 1.20, N = 3SE +/- 24.85, N = 3SE +/- 44.35, N = 3SE +/- 16.67, N = 34253349145549639140815474107139737138025313

CUDA Mini-Nbody

Test: Original

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: OriginalGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX 750GeForce GTX 780 TiGeForce GTX TITAN XGeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiGeForce GTX 9504080120160200SE +/- 0.11, N = 3SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.50, N = 3SE +/- 0.35, N = 3SE +/- 0.43, N = 3SE +/- 0.10, N = 3SE +/- 0.13, N = 3SE +/- 0.57, N = 3SE +/- 0.21, N = 320.2220.40180.6661.0332.3782.0145.3854.3234.58105.30

CUDA Mini-Nbody

Test: Cache Blocking

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Cache BlockingGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX 750GeForce GTX 780 TiGeForce GTX TITAN XGeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiGeForce GTX 95020406080100SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.27, N = 3SE +/- 0.10, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.21, N = 3SE +/- 0.02, N = 311.3711.4998.1929.9918.6537.0825.1328.5319.7749.89

CUDA Mini-Nbody

Test: Loop Unrolling

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Loop UnrollingGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX 750GeForce GTX 780 TiGeForce GTX TITAN XGeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiGeForce GTX 95020406080100SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.25, N = 3SE +/- 0.03, N = 3SE +/- 0.21, N = 3SE +/- 0.02, N = 3SE +/- 0.15, N = 3SE +/- 0.03, N = 311.9012.0289.3427.0517.5935.3523.8826.4218.4647.54

CUDA Mini-Nbody

Test: SOA Data Layout

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: SOA Data LayoutGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX 750GeForce GTX 780 TiGeForce GTX TITAN XGeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiGeForce GTX 9504080120160200SE +/- 0.05, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.16, N = 3SE +/- 0.20, N = 3SE +/- 0.08, N = 3SE +/- 0.21, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.02, N = 322.2722.34199.9554.3937.4379.9750.1555.8740.94108.50

CUDA Mini-Nbody

Test: Flush Denormals To Zero

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Flush Denormals To ZeroGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX 750GeForce GTX 780 TiGeForce GTX TITAN XGeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiGeForce GTX 9504080120160200SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.18, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.02, N = 322.2221.99199.8353.2637.3779.8449.5355.8040.85108.48


Phoronix Test Suite v10.8.5