GPU Compute

AMD Ryzen 9 3950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS) and NVIDIA GeForce GTX 1060 6GB on Ubuntu 20.04 via the Phoronix Test Suite.

TITAN RTX

Processor: AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: NVIDIA TITAN RTX 24GB (420/405MHz), Audio: NVIDIA TU102 HD Audio, Monitor: DELL P2415Q, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200

OS: Ubuntu 20.04, Kernel: 5.4.0-45-generic (x86_64), Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Display Driver: NVIDIA 450.66, OpenGL: 4.6.0, OpenCL: OpenCL 1.2 CUDA 11.0.228, Vulkan: 1.2.133, Compiler: GCC 9.3.0 + CUDA 11.0, File-System: ext4, Screen Resolution: 3840x2160

Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013
OpenCL Notes: GPU Compute Cores: 4608
Python Notes: Python 3.8.2
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

GTX 1070 Ti

Changed Graphics to Zotac NVIDIA GeForce GTX 1070 Ti 8GB (1607/4006MHz).

OpenCL Change: GPU Compute Cores: 2432

GTX 970

Changed Graphics to eVGA NVIDIA GeForce GTX 970 4GB (1163/3505MHz).

OpenCL Change: GPU Compute Cores: 1664

GTX 1660 SUPER

Changed Graphics to eVGA NVIDIA GeForce GTX 1660 SUPER 6GB (1530/7000MHz).

OpenCL Change: GPU Compute Cores: 1408

GTX 980

Changed Graphics to NVIDIA GeForce GTX 980 4GB (1126/3505MHz).

OpenCL Change: GPU Compute Cores: 2048

GTX 1070

Changed Graphics to NVIDIA GeForce GTX 1070 8GB (1506/4006MHz).

OpenCL Change: GPU Compute Cores: 1920

GTX 1080

Changed Graphics to NVIDIA GeForce GTX 1080 8GB (1607/5005MHz).

OpenCL Change: GPU Compute Cores: 2560

GTX 980 Ti

Changed Graphics to NVIDIA GeForce GTX 980 Ti 6GB (999/3505MHz).

OpenCL Change: GPU Compute Cores: 2816

RTX 2080 Ti

Changed Graphics to NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz).

OpenCL Change: GPU Compute Cores: 4352

GTX 1060

Processor: AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: NVIDIA GeForce GTX 1060 6GB (1506/4006MHz), Audio: NVIDIA GP106 HD Audio, Monitor: DELL P2415Q, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200

OS: Ubuntu 20.04, Kernel: 5.4.0-47-generic (x86_64), Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Display Driver: NVIDIA 450.66, OpenGL: 4.6.0, OpenCL: OpenCL 1.2 CUDA 11.0.228, Vulkan: 1.2.133, Compiler: GCC 9.3.0 + CUDA 11.0, File-System: ext4, Screen Resolution: 3840x2160

Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013
OpenCL Notes: GPU Compute Cores: 1280
Python Notes: Python 3.8.2
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

Result

Nodes Per Second Per Watt

GPU Power Consumption

GPU Temp

GROMACS

The CUDA version of the Gromacs molecular dynamics package. Learn more via the OpenBenchmarking.org test page.

Result

Ns Per Day Per Watt

GPU Power Consumption

GPU Temp

OctaneBench

OctaneBench is a test of the OctaneRender on the GPU and requires the use of NVIDIA CUDA. Learn more via the OpenBenchmarking.org test page.

Result

Score Per Watt

GPU Power Consumption

GPU Temp

FAHBench

FAHBench is a Folding@Home benchmark on the GPU. Learn more via the OpenBenchmarking.org test page.

Result

Ns Per Day Per Watt

GPU Power Consumption

GPU Temp

LuxCoreRender OpenCL

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on OpenCL accelerators/GPUs. The alternative luxcorerender test profile is for CPU execution due to a difference in tests, etc. Learn more via the OpenBenchmarking.org test page.

Result

M samples/sec Per Watt

GPU Power Consumption

GPU Temp

Result

M samples/sec Per Watt

GPU Power Consumption

GPU Temp

Darmstadt Automotive Parallel Heterogeneous Suite

DAPHNE is the Darmstadt Automotive Parallel HeterogeNEous Benchmark Suite with OpenCL / CUDA / OpenMP test cases for these automotive benchmarks for evaluating programming models in context to vehicle autonomous driving capabilities. Learn more via the OpenBenchmarking.org test page.

Result

Test Cases Per Minute Per Watt

GPU Power Consumption

GPU Temp

Result

Test Cases Per Minute Per Watt

GPU Power Consumption

GPU Temp

LuxCoreRender OpenCL

Result

M samples/sec Per Watt

GPU Power Consumption

GPU Temp

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

Result

FPS Per Watt

GPU Power Consumption

GPU Temp

LuxCoreRender OpenCL

Result

M samples/sec Per Watt

GPU Power Consumption

GPU Temp

Rodinia

Result

GPU Power Consumption

GPU Temp

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

Result

FPS Per Watt

GPU Power Consumption

GPU Temp

Darmstadt Automotive Parallel Heterogeneous Suite

Result

Test Cases Per Minute Per Watt

GPU Power Consumption

GPU Temp

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

Result

FPS Per Watt

GPU Power Consumption

GPU Temp

Result

FPS Per Watt

GPU Power Consumption

GPU Temp

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

Result

FPS Per Watt

GPU Power Consumption

GPU Temp

Darmstadt Automotive Parallel Heterogeneous Suite

Result

Test Cases Per Minute Per Watt

GPU Power Consumption

GPU Temp

ArrayFire

ArrayFire is an GPU and CPU numeric processing library, this test uses the built-in CPU and OpenCL ArrayFire benchmarks. Learn more via the OpenBenchmarking.org test page.

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Darmstadt Automotive Parallel Heterogeneous Suite

Result

Test Cases Per Minute Per Watt

GPU Power Consumption

GPU Temp

NAMD CUDA

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. This version of the NAMD test profile uses CUDA GPU acceleration. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

Result

FPS Per Watt

GPU Power Consumption

GPU Temp

Darmstadt Automotive Parallel Heterogeneous Suite

Result

Test Cases Per Minute Per Watt

GPU Power Consumption

GPU Temp

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

Result

Samples/sec Per Watt

Power Consumption

Temp

Rodinia

Result

GPU Power Consumption

GPU Temp

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

Result

FPS Per Watt

GPU Power Consumption

GPU Temp

Result

FPS Per Watt

GPU Power Consumption

GPU Temp

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

Result

GPU Power Consumption

GPU Temp

Result

GPU Power Consumption

GPU Temp

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Result

GIOPS Per Watt

GPU Power Consumption

GPU Temp

ArrayFire

ArrayFire is an GPU and CPU numeric processing library, this test uses the built-in CPU and OpenCL ArrayFire benchmarks. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile uses ViennaCL OpenCL support and runs the included computational benchmark. Learn more via the OpenBenchmarking.org test page.

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Mixbench

A benchmark suite for GPUs on mixed operational intensity kernels. Learn more via the OpenBenchmarking.org test page.

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Result

GBPS Per Watt

GPU Power Consumption

GPU Temp

Mixbench

A benchmark suite for GPUs on mixed operational intensity kernels. Learn more via the OpenBenchmarking.org test page.

Result

GIOPS Per Watt

GPU Power Consumption

GPU Temp

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Mixbench

A benchmark suite for GPUs on mixed operational intensity kernels. Learn more via the OpenBenchmarking.org test page.

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Darktable

Result

GPU Power Consumption

GPU Temp

NeatBench

NeatBench is a benchmark of the cross-platform Neat Video software on the CPU and optional GPU (OpenCL / CUDA) support. Learn more via the OpenBenchmarking.org test page.

Result

FPS Per Watt

Power Consumption

Temp

Mixbench

A benchmark suite for GPUs on mixed operational intensity kernels. Learn more via the OpenBenchmarking.org test page.

Result

GIOPS Per Watt

GPU Power Consumption

GPU Temp

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

GPU Temperature Monitor

GPU Power Consumption Monitor

Meta Performance Per Watts

54 Results Shown

LeelaChessZero
GROMACS
OctaneBench
FAHBench
LuxCoreRender OpenCL:
Food
LuxCore Benchmark
Darmstadt Automotive Parallel Heterogeneous Suite:
OpenCL - Points2Image
NVIDIA CUDA - Points2Image
LuxCoreRender OpenCL
Rodinia
PlaidML
LuxCoreRender OpenCL
Rodinia
PlaidML
Darmstadt Automotive Parallel Heterogeneous Suite
PlaidML:
No - Training - Mobilenet - OpenCL
No - Inference - VGG16 - OpenCL
clpeak
PlaidML
Darmstadt Automotive Parallel Heterogeneous Suite
ArrayFire
Darmstadt Automotive Parallel Heterogeneous Suite
NAMD CUDA
PlaidML
Darmstadt Automotive Parallel Heterogeneous Suite
MandelGPU
Rodinia
cl-mem:
Read
Write
Copy
PlaidML:
Yes - Inference - Mobilenet - OpenCL
No - Inference - Mobilenet - OpenCL
FinanceBench
Darktable:
Masskrug - OpenCL
Boat - OpenCL
Server Room - OpenCL
clpeak
ArrayFire
ViennaCL
Mixbench
clpeak
Mixbench:
NVIDIA CUDA - Integer
NVIDIA CUDA - Double Precision
NVIDIA CUDA - Half Precision
clpeak
Mixbench
Darktable
NeatBench
Mixbench:
OpenCL - Integer
OpenCL - Double Precision
FinanceBench
GPU Temperature Monitor:
Phoronix Test Suite System Monitoring:
Celsius
Watts
Performance Per Watts:
Performance Per Watts

TITAN RTX

Testing initiated at 6 September 2020 14:22 by user pts.

GTX 1070 Ti

Processor: AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: Zotac NVIDIA GeForce GTX 1070 Ti 8GB (1607/4006MHz), Audio: NVIDIA GP104 HD Audio, Monitor: DELL P2415Q, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200

Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013
OpenCL Notes: GPU Compute Cores: 2432
Python Notes: Python 3.8.2
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 6 September 2020 17:24 by user pts.

GTX 970

Processor: AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: eVGA NVIDIA GeForce GTX 970 4GB (1163/3505MHz), Audio: NVIDIA GM204 HD Audio, Monitor: DELL P2415Q, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200

Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013
OpenCL Notes: GPU Compute Cores: 1664
Python Notes: Python 3.8.2
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 6 September 2020 19:52 by user pts.

GTX 1660 SUPER

Processor: AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: eVGA NVIDIA GeForce GTX 1660 SUPER 6GB (1530/7000MHz), Audio: NVIDIA TU116 HD Audio, Monitor: DELL P2415Q, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200

Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013
OpenCL Notes: GPU Compute Cores: 1408
Python Notes: Python 3.8.2
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 7 September 2020 06:29 by user pts.

GTX 980

Processor: AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: NVIDIA GeForce GTX 980 4GB (1126/3505MHz), Audio: NVIDIA GM204 HD Audio, Monitor: DELL P2415Q, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200

Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013
OpenCL Notes: GPU Compute Cores: 2048
Python Notes: Python 3.8.2
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 7 September 2020 09:43 by user pts.

GTX 1070

Processor: AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: NVIDIA GeForce GTX 1070 8GB (1506/4006MHz), Audio: NVIDIA GP104 HD Audio, Monitor: DELL P2415Q, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200

Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013
OpenCL Notes: GPU Compute Cores: 1920
Python Notes: Python 3.8.2
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 7 September 2020 11:50 by user pts.

GTX 1080

Processor: AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: NVIDIA GeForce GTX 1080 8GB (1607/5005MHz), Audio: NVIDIA GP104 HD Audio, Monitor: DELL P2415Q, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200

Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013
OpenCL Notes: GPU Compute Cores: 2560
Python Notes: Python 3.8.2
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 7 September 2020 14:05 by user pts.

GTX 980 Ti

Processor: AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: NVIDIA GeForce GTX 980 Ti 6GB (999/3505MHz), Audio: NVIDIA GM200 HD Audio, Monitor: DELL P2415Q, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200

Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013
OpenCL Notes: GPU Compute Cores: 2816
Python Notes: Python 3.8.2
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 7 September 2020 17:26 by user pts.

RTX 2080 Ti

Processor: AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz), Audio: NVIDIA TU102 HD Audio, Monitor: DELL P2415Q, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200

Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013
OpenCL Notes: GPU Compute Cores: 4352
Python Notes: Python 3.8.2
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 7 September 2020 19:56 by user pts.

GTX 1060

Processor: AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: NVIDIA GeForce GTX 1060 6GB (1506/4006MHz), Audio: NVIDIA GP106 HD Audio, Monitor: DELL P2415Q, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200

Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013
OpenCL Notes: GPU Compute Cores: 1280
Python Notes: Python 3.8.2
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 8 September 2020 06:32 by user phoronix.

GPU Compute

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

TITAN RTX

GTX 1070 Ti

GTX 970

GTX 1660 SUPER

GTX 980

GTX 1070

GTX 1080

GTX 980 Ti

RTX 2080 Ti

GTX 1060

LeelaChessZero

GROMACS

OctaneBench

FAHBench

LuxCoreRender OpenCL

Darmstadt Automotive Parallel Heterogeneous Suite

LuxCoreRender OpenCL

Rodinia

PlaidML

LuxCoreRender OpenCL

Rodinia

PlaidML

Darmstadt Automotive Parallel Heterogeneous Suite

PlaidML

clpeak

PlaidML

Darmstadt Automotive Parallel Heterogeneous Suite

ArrayFire

Darmstadt Automotive Parallel Heterogeneous Suite

NAMD CUDA

PlaidML

Darmstadt Automotive Parallel Heterogeneous Suite

MandelGPU

Rodinia

cl-mem

PlaidML

FinanceBench

Darktable

clpeak

ArrayFire

ViennaCL

Mixbench

clpeak

Mixbench

clpeak

Mixbench

Darktable

NeatBench

Mixbench

FinanceBench

GPU Temperature Monitor

GPU Power Consumption Monitor

Meta Performance Per Watts

54 Results Shown

TITAN RTX

GTX 1070 Ti

GTX 970

GTX 1660 SUPER

GTX 980

GTX 1070

GTX 1080

GTX 980 Ti

RTX 2080 Ti

GTX 1060