Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 1710029-TY-1511113PT61 OpenCL CUDA NVIDIA GPGPU Linux Tests - Phoronix Test Suite OpenCL CUDA NVIDIA GPGPU Linux Tests TR1950X 4way1080ti
HTML result view exported from: https://openbenchmarking.org/result/1710029-TY-1511113PT61&export=txt&grr&rdt&rro .
OpenCL CUDA NVIDIA GPGPU Linux Tests Processor Motherboard Chipset Memory Disk Graphics Audio Network Monitor OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution GeForce GTX 950 GeForce GTX 980 Ti GeForce GTX 970 GeForce GTX 980 GeForce GTX 960 GeForce GTX TITAN X GeForce GTX 780 Ti GeForce GTX 680 GeForce GTX 750 GeForce GTX 760 TR1950X 4way1080ti TR1950X 4way1080tigtx TR1950X quad1080ti TR1950X quad1080tigtx Intel Core i5-6600K @ 3.50GHz (4 Cores) MSI Z170A GAMING PRO (MS-7984) v1.0 Intel Device 191f 16384MB 256GB TS256GSSD370S eVGA NVIDIA GeForce GTX 950 2048MB (135/405MHz) Intel Device a170 Intel Device 15b8 Ubuntu 14.04 3.19.0-33-generic (x86_64) Unity 7.2.5 X Server 1.17.1 NVIDIA 352.39 4.3.0 GCC 4.8.4 + Clang 3.4-1ubuntu3 + CUDA 7.5 ext4 3840x2160 NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz) eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz) NVIDIA GeForce GTX 980 4096MB (1126/3505MHz) eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz) NVIDIA GeForce GTX TITAN X 12288MB (1001/3505MHz) NVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz) NVIDIA GeForce GTX 680 2048MB (1006/3004MHz) eVGA NVIDIA GeForce GTX 750 1024MB (1019/2505MHz) NVIDIA GeForce GTX 760 2048MB (980/3004MHz) AMD Ryzen Threadripper 1950X 16-Core @ 3.40GHz (32 Cores) ASRock X399 Professional Gaming AMD Device 1450 8 x 16384 MB DDR4-1467MHz 120GB HP SSD S700 120G + Samsung SSD 960 EVO 250GB + 1000GB Samsung SSD 960 EVO 1TB EFI VGA NVIDIA Device 10ef Device 1d6a:d107 + Intel Device 24fb Ubuntu 16.04 4.13.4-041304-generic (x86_64) modesetting 1.19.3 GCC 5.4.0 20160609 + CUDA 8.0 zfs 2560x1440 MSI NVIDIA GeForce GTX 1080 Ti 11264MB (1544/5508MHz) Dell S2716DG Unity 7.4.0 X Server 1.19.3 NVIDIA 384.90 4.5.0 OpenBenchmarking.org Compiler Details - GeForce GTX 950: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - GeForce GTX 980 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - GeForce GTX 970: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - GeForce GTX 980: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - GeForce GTX 960: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - GeForce GTX TITAN X: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - GeForce GTX 780 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - GeForce GTX 680: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - GeForce GTX 750: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - GeForce GTX 760: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - TR1950X 4way1080ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - TR1950X 4way1080tigtx: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - TR1950X quad1080ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - TR1950X quad1080tigtx: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details - GeForce GTX 950: Scaling Governor: acpi-cpufreq performance - GeForce GTX 980 Ti: Scaling Governor: acpi-cpufreq performance - GeForce GTX 970: Scaling Governor: acpi-cpufreq performance - GeForce GTX 980: Scaling Governor: acpi-cpufreq performance - GeForce GTX 960: Scaling Governor: acpi-cpufreq performance - GeForce GTX TITAN X: Scaling Governor: acpi-cpufreq performance - GeForce GTX 780 Ti: Scaling Governor: acpi-cpufreq performance - GeForce GTX 680: Scaling Governor: acpi-cpufreq performance - GeForce GTX 750: Scaling Governor: acpi-cpufreq performance - GeForce GTX 760: Scaling Governor: acpi-cpufreq performance - TR1950X 4way1080ti: Scaling Governor: acpi-cpufreq ondemand - TR1950X 4way1080tigtx: Scaling Governor: acpi-cpufreq ondemand - TR1950X quad1080ti: Scaling Governor: acpi-cpufreq ondemand - TR1950X quad1080tigtx: Scaling Governor: acpi-cpufreq ondemand OpenCL Details - GeForce GTX 950: GPU Compute Cores: 768 - GeForce GTX 980 Ti: GPU Compute Cores: 2816 - GeForce GTX 970: GPU Compute Cores: 1664 - GeForce GTX 980: GPU Compute Cores: 2048 - GeForce GTX 960: GPU Compute Cores: 1024 - GeForce GTX TITAN X: GPU Compute Cores: 3072 - GeForce GTX 780 Ti: GPU Compute Cores: 2880 - GeForce GTX 680: GPU Compute Cores: 1536 - GeForce GTX 750: GPU Compute Cores: 512 - GeForce GTX 760: GPU Compute Cores: 1152 - TR1950X 4way1080tigtx: GPU Compute Cores: 3584 - TR1950X quad1080ti: GPU Compute Cores: 3584 - TR1950X quad1080tigtx: GPU Compute Cores: 3584 System Details - GeForce GTX 950: GPU Compute Cores: 768. - GeForce GTX 980 Ti: GPU Compute Cores: 2816. - GeForce GTX 970: GPU Compute Cores: 1664. - GeForce GTX 980: GPU Compute Cores: 2048. - GeForce GTX 960: GPU Compute Cores: 1024. - GeForce GTX TITAN X: GPU Compute Cores: 3072. - GeForce GTX 780 Ti: GPU Compute Cores: 2880. - GeForce GTX 680: GPU Compute Cores: 1536. - GeForce GTX 750: GPU Compute Cores: 512. - GeForce GTX 760: GPU Compute Cores: 1152. - TR1950X 4way1080tigtx: GPU Compute Cores: 3584. - TR1950X quad1080ti: GPU Compute Cores: 3584. - TR1950X quad1080tigtx: GPU Compute Cores: 3584.
OpenCL CUDA NVIDIA GPGPU Linux Tests luxmark: GPU - Luxball HDR luxmark: GPU - Microphone luxmark: GPU - Hotel mandelbulbgpu: GPU juliagpu: GPU cuda-mini-nbody: Flush Denormals To Zero cuda-mini-nbody: SOA Data Layout cuda-mini-nbody: Loop Unrolling cuda-mini-nbody: Cache Blocking cuda-mini-nbody: Original askap: Degridding askap: Gridding shoc: OpenCL - Texture Read Bandwidth shoc: CUDA - Texture Read Bandwidth shoc: OpenCL - MD5 Hash shoc: OpenCL - FFT SP shoc: CUDA - MD5 Hash shoc: CUDA - FFT SP GeForce GTX 950 GeForce GTX 980 Ti GeForce GTX 970 GeForce GTX 980 GeForce GTX 960 GeForce GTX TITAN X GeForce GTX 780 Ti GeForce GTX 680 GeForce GTX 750 GeForce GTX 760 TR1950X 4way1080ti TR1950X 4way1080tigtx TR1950X quad1080ti TR1950X quad1080tigtx 5313 2423 769 37156070.87 64913682.63 108.48 108.50 47.54 49.89 105.30 5706.07 3399.14 239.19 326.23 2.34 63.22 2.36 172.28 13802 6268 1855 71656708.83 127978049.53 40.85 40.94 18.46 19.77 34.58 17380.60 8320.50 345.55 348.92 6.79 170.36 6.81 311.46 9737 4458 1346 58811317.17 104144917.23 55.80 55.87 26.42 28.53 54.32 9509.14 5325.12 283.36 325.16 4.77 117.23 4.79 263.14 10713 4776 1492 63616558.77 113830604.27 49.53 50.15 23.88 25.13 45.38 11094 6051.27 332.60 336.48 5.68 140.12 5.70 289.63 5474 2460 897 44953399.47 80042041.73 79.84 79.97 35.35 37.08 82.01 5290.32 3144.85 269.98 351.31 3.36 62.78 3.38 212.43 14081 6360 1906 75614774.13 136037921.43 37.37 37.43 17.59 18.65 32.37 17380.60 8458.77 354.09 356.52 7.41 173.89 7.42 324.09 9639 4302 992 47400001.90 78839770.13 53.26 54.39 27.05 29.99 61.03 286.62 3.78 126.71 4554 2127 577 31636512.97 48074789.03 242.16 1.91 74.97 3491 20060275.53 36136874.00 199.83 199.95 89.34 98.19 180.66 121.14 158.42 1.07 54.69 1.08 113.64 4253 1941 463 25392138.50 38310650.50 170.26 1.40 78.44 81533 41545 15393 105735837.57 159492716.80 22.71 22.77 12.90 12.18 22.49 21619.07 11094 627.38 632.61 15.94 293.37 15.97 464.39 460.40 81567 41075 15252 107641345.20 159005850.77 21.93 22.03 12.03 11.37 21.77 22188 10946.07 624.43 629.47 15.88 290.19 15.93 465.39 OpenBenchmarking.org
LuxMark OpenCL Device: GPU - Scene: Luxball HDR OpenBenchmarking.org Score, More Is Better LuxMark 3.0 OpenCL Device: GPU - Scene: Luxball HDR TR1950X quad1080tigtx TR1950X 4way1080tigtx GeForce GTX 760 GeForce GTX 750 GeForce GTX 680 GeForce GTX 780 Ti GeForce GTX TITAN X GeForce GTX 960 GeForce GTX 980 GeForce GTX 970 GeForce GTX 980 Ti GeForce GTX 950 20K 40K 60K 80K 100K SE +/- 100.65, N = 3 SE +/- 211.33, N = 3 SE +/- 1.45, N = 3 SE +/- 11.67, N = 3 SE +/- 12.17, N = 3 SE +/- 35.97, N = 3 SE +/- 4.70, N = 3 SE +/- 0.88, N = 3 SE +/- 1.20, N = 3 SE +/- 24.85, N = 3 SE +/- 44.35, N = 3 SE +/- 16.67, N = 3 81567 81533 4253 3491 4554 9639 14081 5474 10713 9737 13802 5313
LuxMark OpenCL Device: GPU - Scene: Microphone OpenBenchmarking.org Score, More Is Better LuxMark 3.0 OpenCL Device: GPU - Scene: Microphone TR1950X quad1080tigtx TR1950X 4way1080tigtx GeForce GTX 760 GeForce GTX 680 GeForce GTX 780 Ti GeForce GTX TITAN X GeForce GTX 960 GeForce GTX 980 GeForce GTX 970 GeForce GTX 980 Ti GeForce GTX 950 9K 18K 27K 36K 45K SE +/- 12.77, N = 3 SE +/- 407.81, N = 3 SE +/- 0.67, N = 3 SE +/- 3.06, N = 3 SE +/- 12.00, N = 3 SE +/- 3.00, N = 3 SE +/- 1.15, N = 3 SE +/- 0.67, N = 3 SE +/- 7.64, N = 3 SE +/- 18.50, N = 3 SE +/- 4.26, N = 3 41075 41545 1941 2127 4302 6360 2460 4776 4458 6268 2423
LuxMark OpenCL Device: GPU - Scene: Hotel OpenBenchmarking.org Score, More Is Better LuxMark 3.0 OpenCL Device: GPU - Scene: Hotel TR1950X quad1080tigtx TR1950X 4way1080tigtx GeForce GTX 760 GeForce GTX 680 GeForce GTX 780 Ti GeForce GTX TITAN X GeForce GTX 960 GeForce GTX 980 GeForce GTX 970 GeForce GTX 980 Ti GeForce GTX 950 3K 6K 9K 12K 15K SE +/- 11.53, N = 3 SE +/- 143.33, N = 3 SE +/- 0.33, N = 3 SE +/- 2.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 SE +/- 1.20, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 15252 15393 463 577 992 1906 897 1492 1346 1855 769
MandelbulbGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelbulbGPU 1.0pts1 OpenCL Device: GPU TR1950X quad1080tigtx TR1950X 4way1080tigtx GeForce GTX 760 GeForce GTX 750 GeForce GTX 680 GeForce GTX 780 Ti GeForce GTX TITAN X GeForce GTX 960 GeForce GTX 980 GeForce GTX 970 GeForce GTX 980 Ti GeForce GTX 950 20M 40M 60M 80M 100M SE +/- 948840.43, N = 3 SE +/- 571468.80, N = 3 SE +/- 28089.31, N = 3 SE +/- 9818.73, N = 3 SE +/- 36731.70, N = 3 SE +/- 48150.35, N = 3 SE +/- 166919.37, N = 3 SE +/- 75512.83, N = 3 SE +/- 140370.89, N = 3 SE +/- 91420.68, N = 3 SE +/- 168304.91, N = 3 SE +/- 29855.85, N = 3 107641345.20 105735837.57 25392138.50 20060275.53 31636512.97 47400001.90 75614774.13 44953399.47 63616558.77 58811317.17 71656708.83 37156070.87 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
JuliaGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better JuliaGPU 1.2pts1 OpenCL Device: GPU TR1950X quad1080tigtx TR1950X 4way1080tigtx GeForce GTX 760 GeForce GTX 750 GeForce GTX 680 GeForce GTX 780 Ti GeForce GTX TITAN X GeForce GTX 960 GeForce GTX 980 GeForce GTX 970 GeForce GTX 980 Ti GeForce GTX 950 30M 60M 90M 120M 150M SE +/- 183274.27, N = 3 SE +/- 1188059.83, N = 3 SE +/- 14125.16, N = 3 SE +/- 22546.70, N = 3 SE +/- 59682.63, N = 3 SE +/- 293396.06, N = 3 SE +/- 318277.32, N = 3 SE +/- 157475.07, N = 3 SE +/- 218639.12, N = 3 SE +/- 84325.23, N = 3 SE +/- 473156.02, N = 3 SE +/- 58084.93, N = 3 159005850.77 159492716.80 38310650.50 36136874.00 48074789.03 78839770.13 136037921.43 80042041.73 113830604.27 104144917.23 127978049.53 64913682.63 1. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
CUDA Mini-Nbody Test: Flush Denormals To Zero OpenBenchmarking.org Seconds, Fewer Is Better CUDA Mini-Nbody 2015-11-10 Test: Flush Denormals To Zero TR1950X quad1080tigtx TR1950X 4way1080tigtx GeForce GTX 750 GeForce GTX 780 Ti GeForce GTX TITAN X GeForce GTX 960 GeForce GTX 980 GeForce GTX 970 GeForce GTX 980 Ti GeForce GTX 950 40 80 120 160 200 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 SE +/- 0.08, N = 3 SE +/- 0.01, N = 3 SE +/- 0.18, N = 3 SE +/- 0.07, N = 3 SE +/- 0.10, N = 3 SE +/- 0.02, N = 3 21.93 22.71 199.83 53.26 37.37 79.84 49.53 55.80 40.85 108.48
CUDA Mini-Nbody Test: SOA Data Layout OpenBenchmarking.org Seconds, Fewer Is Better CUDA Mini-Nbody 2015-11-10 Test: SOA Data Layout TR1950X quad1080tigtx TR1950X 4way1080tigtx GeForce GTX 750 GeForce GTX 780 Ti GeForce GTX TITAN X GeForce GTX 960 GeForce GTX 980 GeForce GTX 970 GeForce GTX 980 Ti GeForce GTX 950 40 80 120 160 200 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.16, N = 3 SE +/- 0.20, N = 3 SE +/- 0.08, N = 3 SE +/- 0.21, N = 3 SE +/- 0.05, N = 3 SE +/- 0.11, N = 3 SE +/- 0.02, N = 3 22.03 22.77 199.95 54.39 37.43 79.97 50.15 55.87 40.94 108.50
CUDA Mini-Nbody Test: Loop Unrolling OpenBenchmarking.org Seconds, Fewer Is Better CUDA Mini-Nbody 2015-11-10 Test: Loop Unrolling TR1950X quad1080tigtx TR1950X 4way1080tigtx GeForce GTX 750 GeForce GTX 780 Ti GeForce GTX TITAN X GeForce GTX 960 GeForce GTX 980 GeForce GTX 970 GeForce GTX 980 Ti GeForce GTX 950 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.25, N = 3 SE +/- 0.03, N = 3 SE +/- 0.21, N = 3 SE +/- 0.02, N = 3 SE +/- 0.15, N = 3 SE +/- 0.03, N = 3 12.03 12.90 89.34 27.05 17.59 35.35 23.88 26.42 18.46 47.54
CUDA Mini-Nbody Test: Cache Blocking OpenBenchmarking.org Seconds, Fewer Is Better CUDA Mini-Nbody 2015-11-10 Test: Cache Blocking TR1950X quad1080tigtx TR1950X 4way1080tigtx GeForce GTX 750 GeForce GTX 780 Ti GeForce GTX TITAN X GeForce GTX 960 GeForce GTX 980 GeForce GTX 970 GeForce GTX 980 Ti GeForce GTX 950 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.27, N = 3 SE +/- 0.10, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.21, N = 3 SE +/- 0.02, N = 3 11.37 12.18 98.19 29.99 18.65 37.08 25.13 28.53 19.77 49.89
CUDA Mini-Nbody Test: Original OpenBenchmarking.org Seconds, Fewer Is Better CUDA Mini-Nbody 2015-11-10 Test: Original TR1950X quad1080tigtx TR1950X 4way1080tigtx GeForce GTX 750 GeForce GTX 780 Ti GeForce GTX TITAN X GeForce GTX 960 GeForce GTX 980 GeForce GTX 970 GeForce GTX 980 Ti GeForce GTX 950 40 80 120 160 200 SE +/- 0.10, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.50, N = 3 SE +/- 0.35, N = 3 SE +/- 0.43, N = 3 SE +/- 0.10, N = 3 SE +/- 0.13, N = 3 SE +/- 0.57, N = 3 SE +/- 0.21, N = 3 21.77 22.49 180.66 61.03 32.37 82.01 45.38 54.32 34.58 105.30
ASKAP tConvolveCuda Processing: Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP tConvolveCuda 2015-11-10 Processing: Degridding TR1950X quad1080tigtx TR1950X 4way1080tigtx GeForce GTX TITAN X GeForce GTX 960 GeForce GTX 980 GeForce GTX 970 GeForce GTX 980 Ti GeForce GTX 950 5K 10K 15K 20K 25K SE +/- 568.93, N = 3 SE +/- 369.80, N = 3 SE +/- 34.80, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 369.80, N = 3 SE +/- 41.05, N = 3 22188.00 21619.07 17380.60 5290.32 11094.00 9509.14 17380.60 5706.07 1. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl
ASKAP tConvolveCuda Processing: Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP tConvolveCuda 2015-11-10 Processing: Gridding TR1950X quad1080tigtx TR1950X 4way1080tigtx GeForce GTX TITAN X GeForce GTX 960 GeForce GTX 980 GeForce GTX 970 GeForce GTX 980 Ti GeForce GTX 950 2K 4K 6K 8K 10K SE +/- 147.93, N = 3 SE +/- 130.14, N = 4 SE +/- 12.43, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 14.40, N = 3 10946.07 11094.00 8458.77 3144.85 6051.27 5325.12 8320.50 3399.14 1. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Texture Read Bandwidth TR1950X quad1080tigtx TR1950X 4way1080tigtx GeForce GTX 760 GeForce GTX 750 GeForce GTX 680 GeForce GTX 780 Ti GeForce GTX TITAN X GeForce GTX 960 GeForce GTX 980 GeForce GTX 970 GeForce GTX 980 Ti GeForce GTX 950 140 280 420 560 700 SE +/- 2.25, N = 3 SE +/- 0.50, N = 3 SE +/- 0.28, N = 3 SE +/- 0.23, N = 3 SE +/- 1.02, N = 3 SE +/- 0.02, N = 3 SE +/- 1.56, N = 3 SE +/- 0.56, N = 3 SE +/- 0.20, N = 3 SE +/- 0.06, N = 3 SE +/- 0.21, N = 3 SE +/- 0.73, N = 3 624.43 627.38 170.26 121.14 242.16 286.62 354.09 269.98 332.60 283.36 345.55 239.19 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: CUDA - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: CUDA - Benchmark: Texture Read Bandwidth TR1950X quad1080tigtx TR1950X 4way1080tigtx GeForce GTX 750 GeForce GTX TITAN X GeForce GTX 960 GeForce GTX 980 GeForce GTX 970 GeForce GTX 980 Ti GeForce GTX 950 140 280 420 560 700 SE +/- 0.81, N = 3 SE +/- 0.60, N = 3 SE +/- 0.42, N = 3 SE +/- 0.12, N = 3 SE +/- 0.14, N = 3 SE +/- 1.15, N = 3 SE +/- 0.28, N = 3 SE +/- 1.22, N = 3 SE +/- 0.85, N = 3 629.47 632.61 158.42 356.52 351.31 336.48 325.16 348.92 326.23 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: MD5 Hash TR1950X quad1080tigtx TR1950X 4way1080tigtx GeForce GTX 760 GeForce GTX 750 GeForce GTX 680 GeForce GTX 780 Ti GeForce GTX TITAN X GeForce GTX 960 GeForce GTX 980 GeForce GTX 970 GeForce GTX 980 Ti GeForce GTX 950 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.07, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 15.88 15.94 1.40 1.07 1.91 3.78 7.41 3.36 5.68 4.77 6.79 2.34 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: FFT SP TR1950X quad1080tigtx TR1950X 4way1080tigtx GeForce GTX 760 GeForce GTX 750 GeForce GTX 680 GeForce GTX 780 Ti GeForce GTX TITAN X GeForce GTX 960 GeForce GTX 980 GeForce GTX 970 GeForce GTX 980 Ti GeForce GTX 950 60 120 180 240 300 SE +/- 3.16, N = 3 SE +/- 1.64, N = 3 SE +/- 0.31, N = 3 SE +/- 0.08, N = 3 SE +/- 0.87, N = 3 SE +/- 0.19, N = 3 SE +/- 0.19, N = 3 SE +/- 1.20, N = 3 SE +/- 1.30, N = 3 SE +/- 0.52, N = 3 SE +/- 0.65, N = 3 SE +/- 0.08, N = 3 290.19 293.37 78.44 54.69 74.97 126.71 173.89 62.78 140.12 117.23 170.36 63.22 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: CUDA - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: CUDA - Benchmark: MD5 Hash TR1950X quad1080tigtx TR1950X 4way1080tigtx GeForce GTX 750 GeForce GTX TITAN X GeForce GTX 960 GeForce GTX 980 GeForce GTX 970 GeForce GTX 980 Ti GeForce GTX 950 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 15.93 15.97 1.08 7.42 3.38 5.70 4.79 6.81 2.36 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: CUDA - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: CUDA - Benchmark: FFT SP TR1950X quad1080tigtx TR1950X quad1080ti TR1950X 4way1080tigtx GeForce GTX 750 GeForce GTX TITAN X GeForce GTX 960 GeForce GTX 980 GeForce GTX 970 GeForce GTX 980 Ti GeForce GTX 950 100 200 300 400 500 SE +/- 0.97, N = 3 SE +/- 6.39, N = 6 SE +/- 4.74, N = 3 SE +/- 0.69, N = 3 SE +/- 1.19, N = 3 SE +/- 1.49, N = 3 SE +/- 3.09, N = 3 SE +/- 2.44, N = 3 SE +/- 0.32, N = 3 SE +/- 0.47, N = 3 465.39 460.40 464.39 113.64 324.09 212.43 289.63 263.14 311.46 172.28 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
Phoronix Test Suite v10.8.4