GPU Compute

Benchmarks for a future article.

RTX 3090

Processor: AMD Ryzen 7 5800X3D 8-Core @ 3.40GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (4201 BIOS), Chipset: AMD Starship/Matisse, Memory: 32GB, Disk: 1000GB Western Digital WDS100T1X0E-00AFY0 + 2000GB Samsung SSD 980 PRO 2TB, Graphics: NVIDIA GeForce RTX 3090 24GB, Audio: NVIDIA GA102 HD Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211

OS: Ubuntu 22.04, Kernel: 5.18.0-051800-generic (x86_64), Desktop: GNOME Shell 42.2, Display Server: X Server 1.21.1.3, Display Driver: NVIDIA 515.49.06, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 11.7.99, Vulkan: 1.2.204, Compiler: GCC 11.2.0, File-System: ext4, Screen Resolution: 3840x2160

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa20120a
Graphics Notes: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 94.02.27.00.02
OpenCL Notes: GPU Compute Cores: 10496
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

RTX 3080

Changed Graphics to NVIDIA GeForce RTX 3080 10GB.

Graphics Change: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.02.20.00.07
OpenCL Change: GPU Compute Cores: 8704

RTX 3060 Ti

Changed Graphics to NVIDIA GeForce RTX 3060 Ti 8GB.

Graphics Change: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.25.00.2c
OpenCL Change: GPU Compute Cores: 4864

RTX 3080 Ti

Changed Graphics to NVIDIA GeForce RTX 3080 Ti 12GB.

Graphics Change: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.02.71.00.01
OpenCL Change: GPU Compute Cores: 10240

RTX 3060

Changed Graphics to eVGA NVIDIA GeForce RTX 3060 12GB.

Graphics Change: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.06.14.40.46
OpenCL Change: GPU Compute Cores: 3584

RTX 3070

Changed Graphics to NVIDIA GeForce RTX 3070 8GB.

Graphics Change: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.25.00.2b
OpenCL Change: GPU Compute Cores: 5888

RTX 3070 Ti

Changed Graphics to NVIDIA GeForce RTX 3070 Ti 8GB.

Graphics Change: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.5b.00.02
OpenCL Change: GPU Compute Cores: 6144

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

Result

GFLOPs/s Per Watt

Result

GFLOPs/s Per Watt

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Result

GHash/s Per Watt

GPU Power Consumption

GPU Temp

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

Result

H/s Per Watt

GPU Power Consumption

GPU Temp

Result

H/s Per Watt

GPU Power Consumption

GPU Temp

Result

H/s Per Watt

GPU Power Consumption

GPU Temp

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Result

GIOPS Per Watt

GPU Power Consumption

GPU Temp

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

Result

H/s Per Watt

GPU Power Consumption

GPU Temp

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

SHOC Scalable HeterOgeneous Computing

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

Result

M samples/sec Per Watt

GPU Power Consumption

GPU Temp

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

Result

H/s Per Watt

GPU Power Consumption

GPU Temp

LuxCoreRender

Result

M samples/sec Per Watt

GPU Power Consumption

GPU Temp

Result

M samples/sec Per Watt

GPU Power Consumption

GPU Temp

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Result

GBPS Per Watt

GPU Power Consumption

GPU Temp

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.

Result

vrays Per Watt

GPU Power Consumption

GPU Temp

SHOC Scalable HeterOgeneous Computing

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

VkResample

VkResample is a Vulkan-based image upscaling library based on VkFFT. The sample input file is upscaling a 4K image to 8K using Vulkan-based GPU acceleration. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

Chaos Group V-RAY

Result

vpaths Per Watt

GPU Power Consumption

GPU Temp

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

Result

M samples/s Per Watt

GPU Power Consumption

GPU Temp

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

Result

Nodes Per Second Per Watt

GPU Power Consumption

GPU Temp

RealSR-NCNN

RealSR-NCNN is an NCNN neural network implementation of the RealSR project and accelerated using the Vulkan API. RealSR is the Real-World Super Resolution via Kernel Estimation and Noise Injection. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image by a scale of 4x with Vulkan. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

OctaneBench

OctaneBench is a test of the OctaneRender on the GPU and requires the use of NVIDIA CUDA. Learn more via the OpenBenchmarking.org test page.

Result

Score Per Watt

GPU Power Consumption

GPU Temp

LuxCoreRender

Result

M samples/sec Per Watt

GPU Power Consumption

GPU Temp

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

Result

Samples/sec Per Watt

Power Consumption

Temp

ViennaCL

Meta Performance Per Watts

LuxCoreRender

Result

M samples/sec Per Watt

GPU Power Consumption

GPU Temp

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

Result

M samples/s Per Watt

GPU Power Consumption

GPU Temp

ViennaCL

FinanceBench

Result

GPU Power Consumption

GPU Temp

FAHBench

FAHBench is a Folding@Home benchmark on the GPU. Learn more via the OpenBenchmarking.org test page.

Result

Ns Per Day Per Watt

GPU Power Consumption

GPU Temp

RealSR-NCNN

Result

GPU Power Consumption

GPU Temp

SHOC Scalable HeterOgeneous Computing

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

ViennaCL

Waifu2x-NCNN Vulkan

Waifu2x-NCNN is an NCNN neural network implementation of the Waifu2x converter project and accelerated using the Vulkan API. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image with Vulkan. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

ViennaCL

SHOC Scalable HeterOgeneous Computing

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

Darktable

Result

GPU Power Consumption

GPU Temp

ViennaCL

JuliaGPU

JuliaGPU is an OpenCL benchmark with this version containing various PTS-specific enhancements. Learn more via the OpenBenchmarking.org test page.

Result

Samples/sec Per Watt

Power Consumption

Temp

ViennaCL

Result

GFLOPs/s Per Watt

GPU Power Consumption

GPU Temp

Darktable

Result

GPU Power Consumption

GPU Temp

Result

GPU Power Consumption

GPU Temp

SHOC Scalable HeterOgeneous Computing

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

Rodinia

Result

GPU Power Consumption

GPU Temp

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Result

GBPS Per Watt

GPU Power Consumption

GPU Temp

Result

GPU Power Consumption

GPU Temp

Result

GBPS Per Watt

GPU Power Consumption

GPU Temp

vkpeak

Vkpeak is a Vulkan compute benchmark inspired by OpenCL's clpeak. Vkpeak provides Vulkan compute performance measurements for FP16 / FP32 / FP64 / INT16 / INT32 scalar and vec4 performance. Learn more via the OpenBenchmarking.org test page.

Result

GIOPS Per Watt

GPU Power Consumption

GPU Temp

SHOC Scalable HeterOgeneous Computing

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

SmallPT GPU

SmallPT GPU is an OpenCL benchmark that's run with various PTS changes compared to upstream and multiple rendering scenes are available. Learn more via the OpenBenchmarking.org test page.

Result

Samples/sec Per Watt

GPU Power Consumption

GPU Temp

Result

Samples/sec Per Watt

GPU Power Consumption

GPU Temp

Result

Samples/sec Per Watt

GPU Power Consumption

GPU Temp

GPU Temperature Monitor

GPU Power Consumption Monitor

79 Results Shown

clpeak
ViennaCL:
OpenCL BLAS - dGEMM-NT
OpenCL BLAS - dGEMM-TN
OpenCL BLAS - dGEMM-NN
SHOC Scalable HeterOgeneous Computing:
OpenCL - GEMM SGEMM_N
OpenCL - MD5 Hash
OpenCL - Max SP Flops
Hashcat:
SHA1
SHA-512
MD5
clpeak
Hashcat
clpeak
SHOC Scalable HeterOgeneous Computing
LuxCoreRender
Hashcat
LuxCoreRender:
LuxCore Benchmark - GPU
DLSC - GPU
FinanceBench
Rodinia
clpeak
cl-mem
Chaos Group V-RAY
SHOC Scalable HeterOgeneous Computing
VkResample
Chaos Group V-RAY
cl-mem
IndigoBench
LeelaChessZero
RealSR-NCNN
OctaneBench
LuxCoreRender
MandelGPU
ViennaCL
Meta Performance Per Watts
LuxCoreRender
IndigoBench
ViennaCL:
OpenCL BLAS - dDOT
OpenCL BLAS - dCOPY
FinanceBench
FAHBench
RealSR-NCNN
SHOC Scalable HeterOgeneous Computing
Darktable
ViennaCL
Waifu2x-NCNN Vulkan
cl-mem
ViennaCL:
OpenCL BLAS - sCOPY
OpenCL BLAS - sDOT
SHOC Scalable HeterOgeneous Computing
Darktable
ViennaCL
JuliaGPU
ViennaCL:
OpenCL BLAS - dGEMM-TT
OpenCL BLAS - dGEMV-N
Darktable:
Server Rack - OpenCL
Masskrug - OpenCL
SHOC Scalable HeterOgeneous Computing
Rodinia
clpeak:
Transfer Bandwidth enqueueReadBuffer
Kernel Latency
Transfer Bandwidth enqueueWriteBuffer
vkpeak:
fp32-vec4
fp64-vec4
fp64-scalar
int16-vec4
int32-scalar
fp16-vec4
int32-vec4
int16-scalar
fp32-scalar
fp16-scalar
SHOC Scalable HeterOgeneous Computing:
OpenCL - Bus Speed Download
OpenCL - Bus Speed Readback
SmallPT GPU:
GPU - Complex
GPU - Cornell
GPU - Caustic3
GPU Temperature Monitor:
Phoronix Test Suite System Monitoring:
Celsius
Watts

RTX 3090

Testing initiated at 10 July 2022 20:50 by user phoronix.

RTX 3080

Processor: AMD Ryzen 7 5800X3D 8-Core @ 3.40GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (4201 BIOS), Chipset: AMD Starship/Matisse, Memory: 32GB, Disk: 1000GB Western Digital WDS100T1X0E-00AFY0 + 2000GB Samsung SSD 980 PRO 2TB, Graphics: NVIDIA GeForce RTX 3080 10GB, Audio: NVIDIA GA102 HD Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa20120a
Graphics Notes: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.02.20.00.07
OpenCL Notes: GPU Compute Cores: 8704
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 15 July 2022 18:28 by user phoronix.

RTX 3060 Ti

Processor: AMD Ryzen 7 5800X3D 8-Core @ 3.40GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (4201 BIOS), Chipset: AMD Starship/Matisse, Memory: 32GB, Disk: 1000GB Western Digital WDS100T1X0E-00AFY0 + 2000GB Samsung SSD 980 PRO 2TB, Graphics: NVIDIA GeForce RTX 3060 Ti 8GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa20120a
Graphics Notes: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.25.00.2c
OpenCL Notes: GPU Compute Cores: 4864
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 16 July 2022 04:59 by user phoronix.

RTX 3080 Ti

Processor: AMD Ryzen 7 5800X3D 8-Core @ 3.40GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (4201 BIOS), Chipset: AMD Starship/Matisse, Memory: 32GB, Disk: 1000GB Western Digital WDS100T1X0E-00AFY0 + 2000GB Samsung SSD 980 PRO 2TB, Graphics: NVIDIA GeForce RTX 3080 Ti 12GB, Audio: NVIDIA GA102 HD Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa20120a
Graphics Notes: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.02.71.00.01
OpenCL Notes: GPU Compute Cores: 10240
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 16 July 2022 08:46 by user phoronix.

RTX 3060

Processor: AMD Ryzen 7 5800X3D 8-Core @ 3.40GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (4201 BIOS), Chipset: AMD Starship/Matisse, Memory: 32GB, Disk: 1000GB Western Digital WDS100T1X0E-00AFY0 + 2000GB Samsung SSD 980 PRO 2TB, Graphics: eVGA NVIDIA GeForce RTX 3060 12GB, Audio: NVIDIA GA106 HD Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa20120a
Graphics Notes: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.06.14.40.46
OpenCL Notes: GPU Compute Cores: 3584
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 16 July 2022 12:28 by user phoronix.

RTX 3070

Processor: AMD Ryzen 7 5800X3D 8-Core @ 3.40GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (4201 BIOS), Chipset: AMD Starship/Matisse, Memory: 32GB, Disk: 1000GB Western Digital WDS100T1X0E-00AFY0 + 2000GB Samsung SSD 980 PRO 2TB, Graphics: NVIDIA GeForce RTX 3070 8GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa20120a
Graphics Notes: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.25.00.2b
OpenCL Notes: GPU Compute Cores: 5888
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 16 July 2022 20:55 by user phoronix.

RTX 3070 Ti

Processor: AMD Ryzen 7 5800X3D 8-Core @ 3.40GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (4201 BIOS), Chipset: AMD Starship/Matisse, Memory: 32GB, Disk: 1000GB Western Digital WDS100T1X0E-00AFY0 + 2000GB Samsung SSD 980 PRO 2TB, Graphics: NVIDIA GeForce RTX 3070 Ti 8GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa20120a
Graphics Notes: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.5b.00.02
OpenCL Notes: GPU Compute Cores: 6144
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 17 July 2022 06:31 by user phoronix.

GPU Compute

View

Limit displaying results to tests within:

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

RTX 3090

RTX 3080

RTX 3060 Ti

RTX 3080 Ti

RTX 3060

RTX 3070

RTX 3070 Ti

clpeak

ViennaCL

SHOC Scalable HeterOgeneous Computing

Hashcat

clpeak

Hashcat

clpeak

SHOC Scalable HeterOgeneous Computing

LuxCoreRender

Hashcat

LuxCoreRender

FinanceBench

Rodinia

clpeak

cl-mem

Chaos Group V-RAY

SHOC Scalable HeterOgeneous Computing

VkResample

Chaos Group V-RAY

cl-mem

IndigoBench

LeelaChessZero

RealSR-NCNN

OctaneBench

LuxCoreRender

MandelGPU

ViennaCL

Meta Performance Per Watts

LuxCoreRender

IndigoBench

ViennaCL

FinanceBench

FAHBench

RealSR-NCNN

SHOC Scalable HeterOgeneous Computing

Darktable

ViennaCL

Waifu2x-NCNN Vulkan

cl-mem

ViennaCL

SHOC Scalable HeterOgeneous Computing

Darktable

ViennaCL

JuliaGPU

ViennaCL

Darktable

SHOC Scalable HeterOgeneous Computing

Rodinia

clpeak

vkpeak

SHOC Scalable HeterOgeneous Computing

SmallPT GPU

GPU Temperature Monitor

GPU Power Consumption Monitor

79 Results Shown

RTX 3090

RTX 3080

RTX 3060 Ti

RTX 3080 Ti

RTX 3060

RTX 3070

RTX 3070 Ti