GPU Compute

AMD Ryzen 9 3950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS) and NVIDIA GeForce GTX 1060 6GB on Ubuntu 20.04 via the Phoronix Test Suite.

TITAN RTX

Processor: AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: NVIDIA TITAN RTX 24GB (420/405MHz), Audio: NVIDIA TU102 HD Audio, Monitor: DELL P2415Q, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200

OS: Ubuntu 20.04, Kernel: 5.4.0-45-generic (x86_64), Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Display Driver: NVIDIA 450.66, OpenGL: 4.6.0, OpenCL: OpenCL 1.2 CUDA 11.0.228, Vulkan: 1.2.133, Compiler: GCC 9.3.0 + CUDA 11.0, File-System: ext4, Screen Resolution: 3840x2160

Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013
OpenCL Notes: GPU Compute Cores: 4608
Python Notes: Python 3.8.2
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

GTX 1070 Ti

Changed Graphics to Zotac NVIDIA GeForce GTX 1070 Ti 8GB (1607/4006MHz).

OpenCL Change: GPU Compute Cores: 2432

GTX 970

Changed Graphics to eVGA NVIDIA GeForce GTX 970 4GB (1163/3505MHz).

OpenCL Change: GPU Compute Cores: 1664

GTX 1660 SUPER

Changed Graphics to eVGA NVIDIA GeForce GTX 1660 SUPER 6GB (1530/7000MHz).

OpenCL Change: GPU Compute Cores: 1408

GTX 980

Changed Graphics to NVIDIA GeForce GTX 980 4GB (1126/3505MHz).

OpenCL Change: GPU Compute Cores: 2048

GTX 1070

Changed Graphics to NVIDIA GeForce GTX 1070 8GB (1506/4006MHz).

OpenCL Change: GPU Compute Cores: 1920

GTX 1080

Changed Graphics to NVIDIA GeForce GTX 1080 8GB (1607/5005MHz).

OpenCL Change: GPU Compute Cores: 2560

GTX 980 Ti

Changed Graphics to NVIDIA GeForce GTX 980 Ti 6GB (999/3505MHz).

OpenCL Change: GPU Compute Cores: 2816

RTX 2080 Ti

Changed Graphics to NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz).

OpenCL Change: GPU Compute Cores: 4352

GTX 1060

Processor: AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: NVIDIA GeForce GTX 1060 6GB (1506/4006MHz), Audio: NVIDIA GP106 HD Audio, Monitor: DELL P2415Q, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200

OS: Ubuntu 20.04, Kernel: 5.4.0-47-generic (x86_64), Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Display Driver: NVIDIA 450.66, OpenGL: 4.6.0, OpenCL: OpenCL 1.2 CUDA 11.0.228, Vulkan: 1.2.133, Compiler: GCC 9.3.0 + CUDA 11.0, File-System: ext4, Screen Resolution: 3840x2160

Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013
OpenCL Notes: GPU Compute Cores: 1280
Python Notes: Python 3.8.2
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

FAHBench

FAHBench is a Folding@Home benchmark on the GPU. Learn more via the OpenBenchmarking.org test page.

Result

Ns Per Day Per Watt

GPU Power Consumption

GPU Temp

GROMACS

The CUDA version of the Gromacs molecular dynamics package. Learn more via the OpenBenchmarking.org test page.

Result

Ns Per Day Per Watt

GPU Power Consumption

GPU Temp

NAMD CUDA

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. This version of the NAMD test profile uses CUDA GPU acceleration. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

Mixbench

A benchmark suite for GPUs on mixed operational intensity kernels. Learn more via the OpenBenchmarking.org test page.

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Result

GIOPS Per Watt

GPU Power Consumption

GPU Temp

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Result

GIOPS Per Watt

GPU Power Consumption

GPU Temp

OctaneBench

OctaneBench is a test of the OctaneRender on the GPU and requires the use of NVIDIA CUDA. Learn more via the OpenBenchmarking.org test page.

Result

Score Per Watt

GPU Power Consumption

GPU Temp

LuxCoreRender OpenCL

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on OpenCL accelerators/GPUs. The alternative luxcorerender test profile is for CPU execution due to a difference in tests, etc. Learn more via the OpenBenchmarking.org test page.

Result

M samples/sec Per Watt

GPU Power Consumption

GPU Temp

Result

M samples/sec Per Watt

GPU Power Consumption

GPU Temp

Result

M samples/sec Per Watt

GPU Power Consumption

GPU Temp

Result

M samples/sec Per Watt

GPU Power Consumption

GPU Temp

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

Result

GPU Power Consumption

GPU Temp

Result

GPU Power Consumption

GPU Temp

ArrayFire

ArrayFire is an GPU and CPU numeric processing library, this test uses the built-in CPU and OpenCL ArrayFire benchmarks. Learn more via the OpenBenchmarking.org test page.

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Result

GPU Power Consumption

GPU Temp

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Result

GBPS Per Watt

GPU Power Consumption

GPU Temp

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Result

GIOPS Per Watt

GPU Power Consumption

GPU Temp

NeatBench

NeatBench is a benchmark of the cross-platform Neat Video software on the CPU and optional GPU (OpenCL / CUDA) support. Learn more via the OpenBenchmarking.org test page.

Result

FPS Per Watt

Power Consumption

Temp

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

Result

GPU Power Consumption

GPU Temp

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

Result

FPS Per Watt

GPU Power Consumption

GPU Temp

Result

FPS Per Watt

GPU Power Consumption

GPU Temp

Result

FPS Per Watt

GPU Power Consumption

GPU Temp

Result

FPS Per Watt

GPU Power Consumption

GPU Temp

Result

FPS Per Watt

GPU Power Consumption

GPU Temp

Result

FPS Per Watt

GPU Power Consumption

GPU Temp

Result

FPS Per Watt

GPU Power Consumption

GPU Temp

Result

FPS Per Watt

GPU Power Consumption

GPU Temp

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

Result

Nodes Per Second Per Watt

GPU Power Consumption

GPU Temp

Darmstadt Automotive Parallel Heterogeneous Suite

DAPHNE is the Darmstadt Automotive Parallel HeterogeNEous Benchmark Suite with OpenCL / CUDA / OpenMP test cases for these automotive benchmarks for evaluating programming models in context to vehicle autonomous driving capabilities. Learn more via the OpenBenchmarking.org test page.

Result