GPU Compute

AMD Ryzen 9 3950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS) and NVIDIA GeForce GTX 1060 6GB on Ubuntu 20.04 via the Phoronix Test Suite.

TITAN RTX

Processor: AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: NVIDIA TITAN RTX 24GB (420/405MHz), Audio: NVIDIA TU102 HD Audio, Monitor: DELL P2415Q, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200

OS: Ubuntu 20.04, Kernel: 5.4.0-45-generic (x86_64), Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Display Driver: NVIDIA 450.66, OpenGL: 4.6.0, OpenCL: OpenCL 1.2 CUDA 11.0.228, Vulkan: 1.2.133, Compiler: GCC 9.3.0 + CUDA 11.0, File-System: ext4, Screen Resolution: 3840x2160

Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013
OpenCL Notes: GPU Compute Cores: 4608
Python Notes: Python 3.8.2
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

GTX 1070 Ti

Changed Graphics to Zotac NVIDIA GeForce GTX 1070 Ti 8GB (1607/4006MHz).

OpenCL Change: GPU Compute Cores: 2432

GTX 970

Changed Graphics to eVGA NVIDIA GeForce GTX 970 4GB (1163/3505MHz).

OpenCL Change: GPU Compute Cores: 1664

GTX 1660 SUPER

Changed Graphics to eVGA NVIDIA GeForce GTX 1660 SUPER 6GB (1530/7000MHz).

OpenCL Change: GPU Compute Cores: 1408

GTX 980

Changed Graphics to NVIDIA GeForce GTX 980 4GB (1126/3505MHz).

OpenCL Change: GPU Compute Cores: 2048

GTX 1070

Changed Graphics to NVIDIA GeForce GTX 1070 8GB (1506/4006MHz).

OpenCL Change: GPU Compute Cores: 1920

GTX 1080

Changed Graphics to NVIDIA GeForce GTX 1080 8GB (1607/5005MHz).

OpenCL Change: GPU Compute Cores: 2560

GTX 980 Ti

Changed Graphics to NVIDIA GeForce GTX 980 Ti 6GB (999/3505MHz).

OpenCL Change: GPU Compute Cores: 2816

RTX 2080 Ti

Changed Graphics to NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz).

OpenCL Change: GPU Compute Cores: 4352

GTX 1060

Processor: AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: NVIDIA GeForce GTX 1060 6GB (1506/4006MHz), Audio: NVIDIA GP106 HD Audio, Monitor: DELL P2415Q, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200

OS: Ubuntu 20.04, Kernel: 5.4.0-47-generic (x86_64), Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Display Driver: NVIDIA 450.66, OpenGL: 4.6.0, OpenCL: OpenCL 1.2 CUDA 11.0.228, Vulkan: 1.2.133, Compiler: GCC 9.3.0 + CUDA 11.0, File-System: ext4, Screen Resolution: 3840x2160

Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013
OpenCL Notes: GPU Compute Cores: 1280
Python Notes: Python 3.8.2
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

ArrayFire

ArrayFire is an GPU and CPU numeric processing library, this test uses the built-in CPU and OpenCL ArrayFire benchmarks. Learn more via the OpenBenchmarking.org test page.

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Result

GPU Power Consumption

GPU Temp

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Result

GBPS Per Watt

GPU Power Consumption

GPU Temp

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Result

GIOPS Per Watt

GPU Power Consumption

GPU Temp

Darktable

Result

GPU Power Consumption

GPU Temp

Result

GPU Power Consumption

GPU Temp

Result

GPU Power Consumption

GPU Temp

Result

GPU Power Consumption

GPU Temp

Darmstadt Automotive Parallel Heterogeneous Suite

DAPHNE is the Darmstadt Automotive Parallel HeterogeNEous Benchmark Suite with OpenCL / CUDA / OpenMP test cases for these automotive benchmarks for evaluating programming models in context to vehicle autonomous driving capabilities. Learn more via the OpenBenchmarking.org test page.

Result