GPU Compute

Benchmarks for a future article.

RTX 3090

Processor: AMD Ryzen 7 5800X3D 8-Core @ 3.40GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (4201 BIOS), Chipset: AMD Starship/Matisse, Memory: 32GB, Disk: 1000GB Western Digital WDS100T1X0E-00AFY0 + 2000GB Samsung SSD 980 PRO 2TB, Graphics: NVIDIA GeForce RTX 3090 24GB, Audio: NVIDIA GA102 HD Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211

OS: Ubuntu 22.04, Kernel: 5.18.0-051800-generic (x86_64), Desktop: GNOME Shell 42.2, Display Server: X Server 1.21.1.3, Display Driver: NVIDIA 515.49.06, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 11.7.99, Vulkan: 1.2.204, Compiler: GCC 11.2.0, File-System: ext4, Screen Resolution: 3840x2160

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa20120a
Graphics Notes: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 94.02.27.00.02
OpenCL Notes: GPU Compute Cores: 10496
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

RTX 3080

Changed Graphics to NVIDIA GeForce RTX 3080 10GB.

Graphics Change: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.02.20.00.07
OpenCL Change: GPU Compute Cores: 8704

RTX 3060 Ti

Changed Graphics to NVIDIA GeForce RTX 3060 Ti 8GB.

Graphics Change: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.25.00.2c
OpenCL Change: GPU Compute Cores: 4864

RTX 3080 Ti

Changed Graphics to NVIDIA GeForce RTX 3080 Ti 12GB.

Graphics Change: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.02.71.00.01
OpenCL Change: GPU Compute Cores: 10240

RTX 3060

Changed Graphics to eVGA NVIDIA GeForce RTX 3060 12GB.

Graphics Change: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.06.14.40.46
OpenCL Change: GPU Compute Cores: 3584

RTX 3070

Changed Graphics to NVIDIA GeForce RTX 3070 8GB.

Graphics Change: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.25.00.2b
OpenCL Change: GPU Compute Cores: 5888

RTX 3070 Ti

Changed Graphics to NVIDIA GeForce RTX 3070 Ti 8GB.

Graphics Change: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.5b.00.02
OpenCL Change: GPU Compute Cores: 6144

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

Result

Nodes Per Second Per Watt

GPU Power Consumption

GPU Temp

vkpeak

Vkpeak is a Vulkan compute benchmark inspired by OpenCL's clpeak. Vkpeak provides Vulkan compute performance measurements for FP16 / FP32 / FP64 / INT16 / INT32 scalar and vec4 performance. Learn more via the OpenBenchmarking.org test page.

Result

GIOPS Per Watt

GPU Power Consumption

GPU Temp

OctaneBench

OctaneBench is a test of the OctaneRender on the GPU and requires the use of NVIDIA CUDA. Learn more via the OpenBenchmarking.org test page.

Result

Score Per Watt

GPU Power Consumption

GPU Temp

FAHBench

FAHBench is a Folding@Home benchmark on the GPU. Learn more via the OpenBenchmarking.org test page.

Result

Ns Per Day Per Watt

GPU Power Consumption

GPU Temp

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

Result

M samples/sec Per Watt

GPU Power Consumption

GPU Temp

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.

Result

vrays Per Watt

GPU Power Consumption

GPU Temp

Result

vpaths Per Watt

GPU Power Consumption

GPU Temp

LuxCoreRender

Result

M samples/sec Per Watt

GPU Power Consumption

GPU Temp

Result

M samples/sec Per Watt

GPU Power Consumption

GPU Temp

Result

M samples/sec Per Watt

GPU Power Consumption

GPU Temp

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

Result

M samples/s Per Watt

GPU Power Consumption

GPU Temp

Result

M samples/s Per Watt

GPU Power Consumption

GPU Temp

RealSR-NCNN

RealSR-NCNN is an NCNN neural network implementation of the RealSR project and accelerated using the Vulkan API. RealSR is the Real-World Super Resolution via Kernel Estimation and Noise Injection. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image by a scale of 4x with Vulkan. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

SmallPT GPU

SmallPT GPU is an OpenCL benchmark that's run with various PTS changes compared to upstream and multiple rendering scenes are available. Learn more via the OpenBenchmarking.org test page.

Result

Samples/sec Per Watt

GPU Power Consumption

GPU Temp

Result

Samples/sec Per Watt

GPU Power Consumption

GPU Temp

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

SmallPT GPU

SmallPT GPU is an OpenCL benchmark that's run with various PTS changes compared to upstream and multiple rendering scenes are available. Learn more via the OpenBenchmarking.org test page.

Result

Samples/sec Per Watt

GPU Power Consumption

GPU Temp

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Result

GBPS Per Watt

GPU Power Consumption

GPU Temp

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

Result

GFLOPs/s Per Watt

GPU Power Consumption

GPU Temp

Result

GFLOPs/s Per Watt

Result

GFLOPs/s Per Watt

SHOC Scalable HeterOgeneous Computing

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

VkResample

VkResample is a Vulkan-based image upscaling library based on VkFFT. The sample input file is upscaling a 4K image to 8K using Vulkan-based GPU acceleration. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

LuxCoreRender

Result

M samples/sec Per Watt

GPU Power Consumption

GPU Temp

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Result

GBPS Per Watt

GPU Power Consumption

GPU Temp

RealSR-NCNN

Result

GPU Power Consumption

GPU Temp

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

Result

H/s Per Watt

GPU Power Consumption

GPU Temp

Result

H/s Per Watt

GPU Power Consumption

GPU Temp

Result

H/s Per Watt

GPU Power Consumption

GPU Temp

Rodinia

Result

GPU Power Consumption

GPU Temp

JuliaGPU

JuliaGPU is an OpenCL benchmark with this version containing various PTS-specific enhancements. Learn more via the OpenBenchmarking.org test page.

Result

Samples/sec Per Watt

Power Consumption

Temp

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

Result

H/s Per Watt

GPU Power Consumption

GPU Temp

Waifu2x-NCNN Vulkan

Waifu2x-NCNN is an NCNN neural network implementation of the Waifu2x converter project and accelerated using the Vulkan API. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image with Vulkan. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

Result

H/s Per Watt

GPU Power Consumption

GPU Temp

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

Result

Samples/sec Per Watt

Power Consumption

Temp

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

SHOC Scalable HeterOgeneous Computing

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

Darktable

Result

GPU Power Consumption

GPU Temp

SHOC Scalable HeterOgeneous Computing

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Darktable

Result

GPU Power Consumption

GPU Temp

SHOC Scalable HeterOgeneous Computing

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Result

GBPS Per Watt

GPU Power Consumption

GPU Temp

Result

GIOPS Per Watt

GPU Power Consumption

GPU Temp

SHOC Scalable HeterOgeneous Computing

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

Result

GHash/s Per Watt

GPU Power Consumption

GPU Temp

Result

GB/s Per Watt

GPU Power Consumption

GPU Temp

Darktable

Result

GPU Power Consumption

GPU Temp

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Result

GFLOPS Per Watt

GPU Power Consumption

GPU Temp

FinanceBench

Result

GPU Power Consumption

GPU Temp

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Result

GPU Power Consumption

GPU Temp

GPU Temperature Monitor

GPU Power Consumption Monitor

Meta Performance Per Watts

79 Results Shown

SHOC Scalable HeterOgeneous Computing
LeelaChessZero
vkpeak:
int16-vec4
int16-scalar
int32-vec4
int32-scalar
fp64-vec4
fp64-scalar
fp16-vec4
fp16-scalar
fp32-vec4
fp32-scalar
OctaneBench
FAHBench
LuxCoreRender
Chaos Group V-RAY:
NVIDIA RTX GPU
NVIDIA CUDA GPU
LuxCoreRender:
LuxCore Benchmark - GPU
DLSC - GPU
Orange Juice - GPU
IndigoBench:
OpenCL GPU - Bedroom
OpenCL GPU - Supercar
RealSR-NCNN
SmallPT GPU:
GPU - Caustic3
GPU - Cornell
clpeak
Rodinia
SmallPT GPU
clpeak
ViennaCL:
OpenCL BLAS - dGEMM-TT
OpenCL BLAS - sDOT
OpenCL BLAS - dGEMM-TN
OpenCL BLAS - sCOPY
OpenCL BLAS - dGEMV-N
OpenCL BLAS - dAXPY
OpenCL BLAS - dGEMV-T
OpenCL BLAS - dGEMM-NN
OpenCL BLAS - sAXPY
OpenCL BLAS - dGEMM-NT
OpenCL BLAS - dDOT
OpenCL BLAS - dCOPY
SHOC Scalable HeterOgeneous Computing
VkResample
LuxCoreRender
clpeak
RealSR-NCNN
FinanceBench
Hashcat:
SHA-512
SHA1
MD5
Rodinia
JuliaGPU
Hashcat
Waifu2x-NCNN Vulkan
cl-mem:
Copy
Read
Write
Hashcat
MandelGPU
Darktable
SHOC Scalable HeterOgeneous Computing
Darktable
SHOC Scalable HeterOgeneous Computing
Darktable
SHOC Scalable HeterOgeneous Computing
clpeak:
Global Memory Bandwidth
Integer Compute INT
SHOC Scalable HeterOgeneous Computing:
OpenCL - Reduction
OpenCL - FFT SP
OpenCL - S3D
OpenCL - MD5 Hash
OpenCL - Bus Speed Download
Darktable
clpeak
FinanceBench
clpeak
GPU Temperature Monitor:
Phoronix Test Suite System Monitoring:
Celsius
Watts
Performance Per Watts:
Performance Per Watts

RTX 3090

Testing initiated at 10 July 2022 20:50 by user phoronix.

RTX 3080

Processor: AMD Ryzen 7 5800X3D 8-Core @ 3.40GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (4201 BIOS), Chipset: AMD Starship/Matisse, Memory: 32GB, Disk: 1000GB Western Digital WDS100T1X0E-00AFY0 + 2000GB Samsung SSD 980 PRO 2TB, Graphics: NVIDIA GeForce RTX 3080 10GB, Audio: NVIDIA GA102 HD Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa20120a
Graphics Notes: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.02.20.00.07
OpenCL Notes: GPU Compute Cores: 8704
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 15 July 2022 18:28 by user phoronix.

RTX 3060 Ti

Processor: AMD Ryzen 7 5800X3D 8-Core @ 3.40GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (4201 BIOS), Chipset: AMD Starship/Matisse, Memory: 32GB, Disk: 1000GB Western Digital WDS100T1X0E-00AFY0 + 2000GB Samsung SSD 980 PRO 2TB, Graphics: NVIDIA GeForce RTX 3060 Ti 8GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa20120a
Graphics Notes: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.25.00.2c
OpenCL Notes: GPU Compute Cores: 4864
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 16 July 2022 04:59 by user phoronix.

RTX 3080 Ti

Processor: AMD Ryzen 7 5800X3D 8-Core @ 3.40GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (4201 BIOS), Chipset: AMD Starship/Matisse, Memory: 32GB, Disk: 1000GB Western Digital WDS100T1X0E-00AFY0 + 2000GB Samsung SSD 980 PRO 2TB, Graphics: NVIDIA GeForce RTX 3080 Ti 12GB, Audio: NVIDIA GA102 HD Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa20120a
Graphics Notes: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.02.71.00.01
OpenCL Notes: GPU Compute Cores: 10240
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 16 July 2022 08:46 by user phoronix.

RTX 3060

Processor: AMD Ryzen 7 5800X3D 8-Core @ 3.40GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (4201 BIOS), Chipset: AMD Starship/Matisse, Memory: 32GB, Disk: 1000GB Western Digital WDS100T1X0E-00AFY0 + 2000GB Samsung SSD 980 PRO 2TB, Graphics: eVGA NVIDIA GeForce RTX 3060 12GB, Audio: NVIDIA GA106 HD Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa20120a
Graphics Notes: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.06.14.40.46
OpenCL Notes: GPU Compute Cores: 3584
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 16 July 2022 12:28 by user phoronix.

RTX 3070

Processor: AMD Ryzen 7 5800X3D 8-Core @ 3.40GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (4201 BIOS), Chipset: AMD Starship/Matisse, Memory: 32GB, Disk: 1000GB Western Digital WDS100T1X0E-00AFY0 + 2000GB Samsung SSD 980 PRO 2TB, Graphics: NVIDIA GeForce RTX 3070 8GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa20120a
Graphics Notes: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.25.00.2b
OpenCL Notes: GPU Compute Cores: 5888
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 16 July 2022 20:55 by user phoronix.

RTX 3070 Ti

Processor: AMD Ryzen 7 5800X3D 8-Core @ 3.40GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (4201 BIOS), Chipset: AMD Starship/Matisse, Memory: 32GB, Disk: 1000GB Western Digital WDS100T1X0E-00AFY0 + 2000GB Samsung SSD 980 PRO 2TB, Graphics: NVIDIA GeForce RTX 3070 Ti 8GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa20120a
Graphics Notes: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.5b.00.02
OpenCL Notes: GPU Compute Cores: 6144
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 17 July 2022 06:31 by user phoronix.

GPU Compute

View

Limit displaying results to tests within:

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

RTX 3090

RTX 3080

RTX 3060 Ti

RTX 3080 Ti

RTX 3060

RTX 3070

RTX 3070 Ti

SHOC Scalable HeterOgeneous Computing

LeelaChessZero

vkpeak

OctaneBench

FAHBench

LuxCoreRender

Chaos Group V-RAY

LuxCoreRender

IndigoBench

RealSR-NCNN

SmallPT GPU

clpeak

Rodinia

SmallPT GPU

clpeak

ViennaCL

SHOC Scalable HeterOgeneous Computing

VkResample

LuxCoreRender

clpeak

RealSR-NCNN

FinanceBench

Hashcat

Rodinia

JuliaGPU

Hashcat

Waifu2x-NCNN Vulkan

cl-mem

Hashcat

MandelGPU

Darktable

SHOC Scalable HeterOgeneous Computing

Darktable

SHOC Scalable HeterOgeneous Computing

Darktable

SHOC Scalable HeterOgeneous Computing

clpeak

SHOC Scalable HeterOgeneous Computing

Darktable

clpeak

FinanceBench

clpeak

GPU Temperature Monitor

GPU Power Consumption Monitor

Meta Performance Per Watts

79 Results Shown

RTX 3090

RTX 3080

RTX 3060 Ti

RTX 3080 Ti

RTX 3060

RTX 3070

RTX 3070 Ti