NVIDIA GeForce RTX 4090/4080 GPU Linux Compute

GPU check

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2303308-NE-2302139PT36
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

CPU Massive 3 Tests
Creator Workloads 4 Tests
HPC - High Performance Computing 2 Tests
Machine Learning 2 Tests
Multi-Core 4 Tests
NVIDIA GPU Compute 11 Tests
OpenCL 3 Tests
Renderers 4 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable
Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
RTX 2080 SUPER
February 11 2023
  1 Hour, 43 Minutes
RTX 2080 Ti
February 11 2023
  1 Hour, 41 Minutes
TITAN RTX
February 11 2023
  1 Hour, 34 Minutes
RTX 3070
February 12 2023
  1 Hour, 43 Minutes
RTX 3070 Ti
February 12 2023
  1 Hour, 32 Minutes
RTX 3080
February 12 2023
  1 Hour, 28 Minutes
RTX 3080 Ti
February 12 2023
  1 Hour, 24 Minutes
RTX 3090
February 12 2023
  1 Hour, 25 Minutes
RTX 4080
February 12 2023
  1 Hour, 22 Minutes
RTX 4090
February 11 2023
  1 Hour, 19 Minutes
rtx3090
March 29 2023
  48 Minutes
Invert Hiding All Results Option
  1 Hour, 27 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA GeForce RTX 4090/4080 GPU Linux ComputeProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionRTX 2080 SUPERRTX 2080 TiTITAN RTXRTX 3070RTX 3070 TiRTX 3080RTX 3080 TiRTX 3090RTX 4080RTX 4090rtx3090AMD Ryzen 9 7950X 16-Core @ 4.50GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR X670E HERO (0805 BIOS)AMD Device 14d832GBWestern Digital WD_BLACK SN850X 1000GB + 2000GBNVIDIA GeForce RTX 2080 SUPER 8GBNVIDIA TU104 HD AudioASUS MG28UIntel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 22.106.2.0-060200rc7daily20230206-generic (x86_64)GNOME Shell 43.1X Server 1.21.1.4NVIDIA 525.89.024.6.0OpenCL 3.0 CUDA 12.0.1471.3.224GCC 12.2.0 + Clang 15.0.6ext43840x2160NVIDIA GeForce RTX 2080 Ti 11GBNVIDIA TU102 HD AudioNVIDIA TITAN RTX 24GBNVIDIA GeForce RTX 3070 8GBNVIDIA GA104 HD AudioNVIDIA GeForce RTX 3070 Ti 8GBNVIDIA GeForce RTX 3080 10GBNVIDIA GA102 HD AudioNVIDIA GeForce RTX 3080 Ti 12GBNVIDIA GeForce RTX 3090 24GBNVIDIA GeForce RTX 4080 16GBNVIDIA Device 22bbNVIDIA GeForce RTX 4090 24GBNVIDIA Device 22baAMD Ryzen Threadripper 3990X 64-Core @ 2.90GHz (64 Cores / 128 Threads)Gigabyte TRX40 DESIGNARE (F5 BIOS)AMD Starship/Matisse256GB2048GB ADATA SX8200PNP + 3 x 2048GB SPCC M.2 PCIe SSD + 5 x 14001GB Western Digital WUH721414ALNVIDIA GeForce RTX 3090 24GBNVIDIA Device 1aef2 x Intel I210 + Intel Wi-Fi 6 AX200Ubuntu 20.045.15.0-67-generic (x86_64)X Server 1.20.11NVIDIAOpenCL 3.0 CUDA 11.6.1341.3.194GCC 9.4.0 + CUDA 11.6btrfs1024x768OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- RTX 2080 SUPER: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v- RTX 2080 Ti: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v- TITAN RTX: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v- RTX 3070: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v- RTX 3070 Ti: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v- RTX 3080: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v- RTX 3080 Ti: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v- RTX 3090: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v- RTX 4080: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v- RTX 4090: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v- rtx3090: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-Av3uEd/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Details- RTX 2080 SUPER: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- RTX 2080 Ti: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- TITAN RTX: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- RTX 3070: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- RTX 3070 Ti: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- RTX 3080: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- RTX 3080 Ti: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- RTX 3090: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- RTX 4080: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- RTX 4090: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa601203- rtx3090: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301039Graphics Details- RTX 2080 SUPER: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 90.04.79.00.01- RTX 2080 Ti: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 90.02.0b.00.0e- TITAN RTX: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 90.02.23.00.01- RTX 3070: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.25.00.2b- RTX 3070 Ti: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.5b.00.02- RTX 3080: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.02.20.00.07- RTX 3080 Ti: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.02.71.00.01- RTX 3090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 94.02.27.00.02- RTX 4080: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.04- RTX 4090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01- rtx3090: BAR1 / Visible vRAM Size: 256 MiBOpenCL Details- RTX 2080 SUPER: GPU Compute Cores: 3072- RTX 2080 Ti: GPU Compute Cores: 4352- TITAN RTX: GPU Compute Cores: 4608- RTX 3070: GPU Compute Cores: 5888- RTX 3070 Ti: GPU Compute Cores: 6144- RTX 3080: GPU Compute Cores: 8704- RTX 3080 Ti: GPU Compute Cores: 10240- RTX 3090: GPU Compute Cores: 10496- RTX 4080: GPU Compute Cores: 9728- RTX 4090: GPU Compute Cores: 16384Security Details- RTX 2080 SUPER: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected- RTX 2080 Ti: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected- TITAN RTX: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected- RTX 3070: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected- RTX 3070 Ti: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected- RTX 3080: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected- RTX 3080 Ti: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected- RTX 3090: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected- RTX 4080: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected- RTX 4090: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected- rtx3090: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

RTX 2080 SUPERRTX 2080 TiTITAN RTXRTX 3070RTX 3070 TiRTX 3080RTX 3080 TiRTX 3090RTX 4080RTX 4090rtx3090Result OverviewPhoronix Test Suite100%205%310%414%519%LuxCoreRenderLeelaChessZeroHashcatclpeakFluidX3DFAHBench

NVIDIA GeForce RTX 4090/4080 GPU Linux Computelczero: OpenCLoctanebench: Total Scoreblender: Barbershop - NVIDIA CUDAfahbench: v-ray: NVIDIA RTX GPUv-ray: NVIDIA CUDA GPUluxcorerender: DLSC - GPUluxcorerender: LuxCore Benchmark - GPUblender: Barbershop - NVIDIA OptiXblender: Pabellon Barcelona - NVIDIA CUDAluxcorerender: Orange Juice - GPUluxcorerender: Danish Mood - GPUindigobench: OpenCL GPU - Bedroomindigobench: OpenCL GPU - Supercarfluidx3d: FP32-FP32clpeak: Double-Precision Doubleblender: Fishy Cat - NVIDIA CUDAblender: Classroom - NVIDIA CUDAnamd-cuda: ATPase Simulation - 327,506 Atomsblender: BMW27 - NVIDIA OptiXfluidx3d: FP32-FP16Cfluidx3d: FP32-FP16Sblender: Pabellon Barcelona - NVIDIA OptiXblender: Fishy Cat - NVIDIA OptiXblender: Classroom - NVIDIA OptiXshoc: OpenCL - Texture Read Bandwidthblender: BMW27 - NVIDIA CUDAluxcorerender: Rainbow Colors and Prism - GPUhashcat: MD5hashcat: SHA-512hashcat: SHA1hashcat: TrueCrypt RIPEMD160 + XTShashcat: 7-Zipshoc: OpenCL - GEMM SGEMM_Nclpeak: Integer Compute INTshoc: OpenCL - Reductionshoc: OpenCL - FFT SPclpeak: Single-Precision Floatshoc: OpenCL - S3Dshoc: OpenCL - MD5 Hashclpeak: Kernel LatencyRTX 2080 SUPERRTX 2080 TiTITAN RTXRTX 3070RTX 3070 TiRTX 3080RTX 3080 TiRTX 3090RTX 4080RTX 4090rtx309025293273.762077186.19266.666110008244.334.46116.06112.124.903.768.40726.1722519375.4541.0042.820.1123110.865213538635.3220.7529.101175.5519.3211.55432723833331719683333135712666675034887166563700.7210370.08342.2791196.1410385.51212.55226.39724.1935017358.03091139.50307.622013709825.605.9789.9079.775.954.8011.58533.8373111518.5730.8332.240.094038.546578662126.3415.9022.301174.9414.7515.10558544166672215683333176342833336354338905254802.2212958.38380.0271474.3113451.55271.61634.73794.4737600379.938611130.62308.778514279875.786.2684.7874.826.215.0312.30335.7413394544.6129.3930.420.091478.086937709524.4914.9921.081158.9014.0716.56585028833332317200000184636666676273479402135050.9913843.96370.2771560.8114568.40288.95036.51924.4336089408.686649146.10257.8758182014116.676.6382.5387.167.015.5211.41434.3642619360.7835.7034.650.096558.865159519824.8716.6822.992137.7316.7520.98415254833331641566667130314833334907006891003648.3410240.62329.0971139.3020086.22219.69925.51594.0242436449.586225136.94258.3843200914936.996.9377.2781.997.305.7712.19635.8363532369.8734.2732.430.095408.125977680322.8815.7721.122068.2215.7221.13428377166671676050000132614666675038137290674717.3510912.89383.8751292.3221506.02286.35226.09963.9753248559.22431101.34310.0926242617699.499.5859.9559.359.237.8015.68243.1784362543.7125.2523.990.079096.628005850718.2511.8416.322201.4611.8226.536027121666723861333331900235000070162510021786454.0515456.53393.8951763.0429451.31340.54937.46724.0258392659.89817887.99319.11142894203710.9411.0752.5650.7210.319.0618.09347.7845240620.8722.2920.750.078475.7893521037115.6210.3814.222158.3410.3531.676777878333326848166672141178333379612511154448184.2417294.44390.1022016.9133745.99418.48041.80864.0861044674.08651784.83331.47352988210311.4111.4750.7448.6110.669.4318.72349.4435386656.0321.3719.930.078515.6698111067815.219.9813.772228.169.9831.447136268333328242666672256653333382030011601448248.1218108.44396.2152103.3535152.14429.20444.11834.1376214961.61201261.80418.08314204306514.9414.5537.5931.5512.6012.5724.02662.7753855849.6516.4613.820.070794.16793078769.827.339.132972.547.4233.42937232000003804083333298714666671115175169326316157.524329.78612.6471816.0248185.06426.93959.30383.68792741335.76190944.81444.11485793433021.5419.1329.5220.5116.9418.2233.39576.75857671417.4611.689.710.068113.3511601112407.785.247.043020.315.3940.511561285714296355000000499718428571879978274731128304.941543.661012.092794.9181720.00648.55893.12023.7319898332.712927.9321.6822.9419.445405646.8892891054357.43995875062506067866667421125333331580467214553318081.9335317.796.12OpenBenchmarking.org

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: OpenCLRTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiTITAN RTXRTX 3070RTX 2080 TiRTX 2080 SUPERrtx309020K40K60K80K100KSE +/- 369.50, N = 3SE +/- 508.16, N = 3SE +/- 236.14, N = 3SE +/- 365.35, N = 3SE +/- 353.32, N = 3SE +/- 136.52, N = 3SE +/- 155.70, N = 3SE +/- 89.51, N = 3SE +/- 245.88, N = 3SE +/- 74.35, N = 3SE +/- 209.52, N = 379274762146104458392532484243637600360893501725293198981. (CXX) g++ options: -flto -pthread

OctaneBench

OctaneBench is a test of the OctaneRender on the GPU and requires the use of NVIDIA CUDA. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterOctaneBench 2020.1Total ScoreRTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070TITAN RTXRTX 2080 TiRTX 2080 SUPER300600900120015001335.76961.61674.09659.90559.22449.59408.69379.94358.03273.76

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: Barbershop - Compute: NVIDIA CUDARTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080TITAN RTXRTX 3070 TiRTX 2080 TiRTX 3070RTX 2080 SUPER4080120160200SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.12, N = 3SE +/- 0.09, N = 3SE +/- 0.13, N = 3SE +/- 0.05, N = 344.8161.8084.8387.99101.34130.62136.94139.50146.10186.19

FAHBench

FAHBench is a Folding@Home benchmark on the GPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2RTX 4090RTX 4080rtx3090RTX 3090RTX 3080 TiRTX 3080TITAN RTXRTX 2080 TiRTX 2080 SUPERRTX 3070 TiRTX 3070100200300400500SE +/- 0.36, N = 3SE +/- 0.20, N = 3SE +/- 0.58, N = 3SE +/- 0.67, N = 3SE +/- 0.14, N = 3SE +/- 0.82, N = 3SE +/- 0.77, N = 3SE +/- 1.15, N = 3SE +/- 0.46, N = 3SE +/- 0.11, N = 3SE +/- 0.22, N = 3444.11418.08332.71331.47319.11310.09308.78307.62266.67258.38257.88

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgvrays, More Is BetterChaos Group V-RAY 5.02Mode: NVIDIA RTX GPURTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070TITAN RTXRTX 2080 TiRTX 2080 SUPER12002400360048006000SE +/- 9.17, N = 3SE +/- 15.62, N = 3SE +/- 2.08, N = 3SE +/- 4.67, N = 3SE +/- 1.45, N = 3SE +/- 1.45, N = 3SE +/- 0.33, N = 3SE +/- 2.00, N = 3SE +/- 1.53, N = 3SE +/- 1.00, N = 35793420429882894242620091820142713701000

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 5.02Mode: NVIDIA CUDA GPURTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070TITAN RTXRTX 2080 TiRTX 2080 SUPER9001800270036004500SE +/- 2.52, N = 3SE +/- 0.88, N = 3SE +/- 4.04, N = 3SE +/- 1.76, N = 3SE +/- 1.20, N = 3SE +/- 1.73, N = 3SE +/- 1.15, N = 3SE +/- 0.58, N = 3SE +/- 1.00, N = 3SE +/- 0.33, N = 34330306521032037176914931411987982824

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: GPUrtx3090RTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070TITAN RTXRTX 2080 TiRTX 2080 SUPER714212835SE +/- 0.19, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 327.9321.5414.9411.4110.949.496.996.675.785.604.33MIN: 25.54 / MAX: 28.52MIN: 19.77 / MAX: 21.76MIN: 14.15 / MAX: 15.07MIN: 10.53 / MAX: 11.84MIN: 10.51 / MAX: 11.13MIN: 9.3 / MAX: 9.77MIN: 6.45 / MAX: 7.23MIN: 6.52 / MAX: 6.85MIN: 5.29 / MAX: 6MIN: 5.47 / MAX: 5.77MIN: 4.09 / MAX: 4.5

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: GPUrtx3090RTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070TITAN RTXRTX 2080 TiRTX 2080 SUPER510152025SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 321.6819.1314.5511.4711.079.586.936.636.265.974.46MIN: 9.19 / MAX: 27.36MIN: 6.67 / MAX: 23.01MIN: 4.97 / MAX: 16.87MIN: 4.04 / MAX: 13.09MIN: 4.03 / MAX: 12.64MIN: 3.17 / MAX: 10.93MIN: 3.08 / MAX: 7.9MIN: 2.81 / MAX: 7.55MIN: 2.78 / MAX: 7.17MIN: 1.91 / MAX: 6.87MIN: 1.95 / MAX: 5.11

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: Barbershop - Compute: NVIDIA OptiXRTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070TITAN RTXRTX 2080 TiRTX 2080 SUPER306090120150SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.19, N = 329.5237.5950.7452.5659.9577.2782.5384.7889.90116.06

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: Pabellon Barcelona - Compute: NVIDIA CUDARTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080TITAN RTXRTX 2080 TiRTX 3070 TiRTX 3070RTX 2080 SUPER306090120150SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 320.5131.5548.6150.7259.3574.8279.7781.9987.16112.12

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: GPUrtx3090RTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070TITAN RTXRTX 2080 TiRTX 2080 SUPER510152025SE +/- 0.27, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 322.9416.9412.6010.6610.319.237.307.016.215.954.90MIN: 19.59 / MAX: 31.58MIN: 14.84 / MAX: 23.97MIN: 10.63 / MAX: 16.95MIN: 8.55 / MAX: 14.16MIN: 8.24 / MAX: 13.67MIN: 7.29 / MAX: 11.91MIN: 5.84 / MAX: 8.87MIN: 5.62 / MAX: 8.48MIN: 4.77 / MAX: 8.28MIN: 4.73 / MAX: 7.46MIN: 4.06 / MAX: 5.43

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: GPUrtx3090RTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070TITAN RTXRTX 2080 TiRTX 2080 SUPER510152025SE +/- 0.02, N = 3SE +/- 0.18, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 319.4418.2212.579.439.067.805.775.525.034.803.76MIN: 7.75 / MAX: 22.58MIN: 6.75 / MAX: 21.09MIN: 4.72 / MAX: 14.53MIN: 3.4 / MAX: 10.86MIN: 3.42 / MAX: 10.44MIN: 2.99 / MAX: 9.01MIN: 2.14 / MAX: 6.67MIN: 2.12 / MAX: 6.36MIN: 1.81 / MAX: 5.76MIN: 1.59 / MAX: 5.52MIN: 1.31 / MAX: 4.31

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: BedroomRTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080TITAN RTXRTX 3070 TiRTX 2080 TiRTX 3070RTX 2080 SUPER816243240SE +/- 0.038, N = 3SE +/- 0.012, N = 3SE +/- 0.008, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.011, N = 3SE +/- 0.001, N = 333.39524.02618.72318.09315.68212.30312.19611.58511.4148.407

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: SupercarRTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiTITAN RTXRTX 3070RTX 2080 TiRTX 2080 SUPER20406080100SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 376.7662.7849.4447.7843.1835.8435.7434.3633.8426.17

FluidX3D

FluidX3D is a speedy and memory efficient Boltzmann CFD (Computational Fluid Dynamics) software package implemented using OpenCL and intended for GPU acceleration. FluidX3D is developed by Moritz Lehmann and written free for non-commercial use. This is a test profile measuring the system OpenCL performance using the FluidX3D benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP32RTX 4090rtx3090RTX 3090RTX 3080 TiRTX 3080RTX 4080RTX 3070 TiTITAN RTXRTX 2080 TiRTX 3070RTX 2080 SUPER12002400360048006000SE +/- 1.33, N = 3SE +/- 1.20, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 3SE +/- 11.50, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 2.85, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 357675405538652404362385535323394311126192519

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleRTX 4090RTX 4080RTX 3090rtx3090RTX 3080 TiTITAN RTXRTX 3080RTX 2080 TiRTX 2080 SUPERRTX 3070 TiRTX 307030060090012001500SE +/- 0.06, N = 3SE +/- 0.16, N = 3SE +/- 1.63, N = 3SE +/- 1.75, N = 3SE +/- 1.58, N = 3SE +/- 1.34, N = 3SE +/- 0.87, N = 3SE +/- 1.27, N = 3SE +/- 0.98, N = 3SE +/- 0.70, N = 3SE +/- 0.01, N = 31417.46849.65656.03646.88620.87544.61543.71518.57375.45369.87360.781. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: Fishy Cat - Compute: NVIDIA CUDARTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080TITAN RTXRTX 2080 TiRTX 3070 TiRTX 3070RTX 2080 SUPER918273645SE +/- 0.01, N = 4SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 311.6816.4621.3722.2925.2529.3930.8334.2735.7041.00

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: Classroom - Compute: NVIDIA CUDARTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080TITAN RTXRTX 2080 TiRTX 3070 TiRTX 3070RTX 2080 SUPER1020304050SE +/- 0.01, N = 5SE +/- 0.01, N = 4SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 39.7113.8219.9320.7523.9930.4232.2432.4334.6542.82

NAMD CUDA

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. This version of the NAMD test profile uses CUDA GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 AtomsRTX 4090RTX 4080RTX 3080 TiRTX 3090RTX 3080TITAN RTXRTX 2080 TiRTX 3070 TiRTX 3070RTX 2080 SUPER0.02530.05060.07590.10120.1265SE +/- 0.00028, N = 3SE +/- 0.00068, N = 15SE +/- 0.00031, N = 6SE +/- 0.00065, N = 9SE +/- 0.00077, N = 6SE +/- 0.00061, N = 13SE +/- 0.00125, N = 3SE +/- 0.00067, N = 6SE +/- 0.00014, N = 3SE +/- 0.00028, N = 60.068110.070790.078470.078510.079090.091470.094030.095400.096550.11231

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: BMW27 - Compute: NVIDIA OptiXRTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080TITAN RTXRTX 3070 TiRTX 2080 TiRTX 3070RTX 2080 SUPER3691215SE +/- 0.00, N = 9SE +/- 0.05, N = 15SE +/- 0.01, N = 7SE +/- 0.01, N = 7SE +/- 0.05, N = 15SE +/- 0.01, N = 6SE +/- 0.01, N = 6SE +/- 0.05, N = 15SE +/- 0.05, N = 15SE +/- 0.10, N = 73.354.165.665.786.628.088.128.548.8610.86

FluidX3D

FluidX3D is a speedy and memory efficient Boltzmann CFD (Computational Fluid Dynamics) software package implemented using OpenCL and intended for GPU acceleration. FluidX3D is developed by Moritz Lehmann and written free for non-commercial use. This is a test profile measuring the system OpenCL performance using the FluidX3D benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP16CRTX 4090RTX 3090RTX 3080 Tirtx3090RTX 3080RTX 4080TITAN RTXRTX 2080 TiRTX 3070 TiRTX 2080 SUPERRTX 30702K4K6K8K10KSE +/- 0.63, N = 4SE +/- 9.21, N = 3SE +/- 10.97, N = 3SE +/- 17.48, N = 3SE +/- 8.67, N = 3SE +/- 0.67, N = 3SE +/- 1.67, N = 3SE +/- 1.53, N = 3SE +/- 6.36, N = 3SE +/- 0.88, N = 3SE +/- 3.06, N = 3116019811935292898005793069376578597752135159

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.3Test: FP32-FP16SRTX 4090RTX 3090rtx3090RTX 3080 TiRTX 3080RTX 4080TITAN RTXRTX 3070 TiRTX 2080 TiRTX 2080 SUPERRTX 30702K4K6K8K10KSE +/- 1.89, N = 4SE +/- 0.85, N = 4SE +/- 0.88, N = 3SE +/- 0.67, N = 3SE +/- 1.00, N = 3SE +/- 6.89, N = 3SE +/- 1.53, N = 3SE +/- 1.33, N = 3SE +/- 1.67, N = 3SE +/- 0.88, N = 3SE +/- 1.00, N = 3112401067810543103718507787670956803662153865198

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXRTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiTITAN RTXRTX 3070RTX 2080 TiRTX 2080 SUPER816243240SE +/- 0.00, N = 6SE +/- 0.01, N = 5SE +/- 0.01, N = 4SE +/- 0.01, N = 4SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 37.789.8215.2115.6218.2522.8824.4924.8726.3435.32

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: Fishy Cat - Compute: NVIDIA OptiXRTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080TITAN RTXRTX 3070 TiRTX 2080 TiRTX 3070RTX 2080 SUPER510152025SE +/- 0.01, N = 7SE +/- 0.05, N = 15SE +/- 0.00, N = 5SE +/- 0.01, N = 5SE +/- 0.11, N = 6SE +/- 0.01, N = 4SE +/- 0.01, N = 4SE +/- 0.17, N = 4SE +/- 0.18, N = 4SE +/- 0.24, N = 35.247.339.9810.3811.8414.9915.7715.9016.6820.75

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: Classroom - Compute: NVIDIA OptiXRTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080TITAN RTXRTX 3070 TiRTX 2080 TiRTX 3070RTX 2080 SUPER714212835SE +/- 0.01, N = 6SE +/- 0.02, N = 5SE +/- 0.00, N = 4SE +/- 0.01, N = 4SE +/- 0.04, N = 4SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 37.049.1313.7714.2216.3221.0821.1222.3022.9929.10

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthRTX 4090RTX 4080RTX 3090RTX 3080RTX 3080 TiRTX 3070RTX 3070 TiRTX 2080 SUPERRTX 2080 TiTITAN RTX6001200180024003000SE +/- 2.83, N = 6SE +/- 1.64, N = 6SE +/- 1.47, N = 3SE +/- 5.24, N = 3SE +/- 6.11, N = 3SE +/- 5.99, N = 4SE +/- 3.72, N = 4SE +/- 3.21, N = 3SE +/- 3.30, N = 3SE +/- 1.25, N = 33020.312972.542228.162201.462158.342137.732068.221175.551174.941158.901. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Target: OpenCL - Benchmark: Texture Read Bandwidth

rtx3090: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: BMW27 - Compute: NVIDIA CUDARTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080TITAN RTXRTX 2080 TiRTX 3070 TiRTX 3070RTX 2080 SUPER510152025SE +/- 0.02, N = 7SE +/- 0.01, N = 6SE +/- 0.02, N = 5SE +/- 0.01, N = 5SE +/- 0.02, N = 4SE +/- 0.02, N = 4SE +/- 0.04, N = 4SE +/- 0.01, N = 4SE +/- 0.01, N = 3SE +/- 0.02, N = 35.397.429.9810.3511.8214.0714.7515.7216.7519.32

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: GPUrtx3090RTX 4090RTX 4080RTX 3080 TiRTX 3090RTX 3080RTX 3070 TiRTX 3070TITAN RTXRTX 2080 TiRTX 2080 SUPER1326395265SE +/- 0.37, N = 3SE +/- 0.35, N = 7SE +/- 0.09, N = 6SE +/- 0.23, N = 6SE +/- 0.24, N = 15SE +/- 0.05, N = 6SE +/- 0.09, N = 5SE +/- 0.06, N = 5SE +/- 0.11, N = 5SE +/- 0.11, N = 4SE +/- 0.02, N = 457.4340.5133.4231.6731.4426.5321.1320.9816.5615.1011.55MIN: 46.94 / MAX: 66.85MIN: 36.85 / MAX: 43.48MIN: 30.02 / MAX: 34.92MIN: 28.49 / MAX: 34.26MIN: 28.69 / MAX: 34.95MIN: 23.68 / MAX: 28.44MIN: 19.76 / MAX: 22.63MIN: 19.19 / MAX: 22.07MIN: 14.55 / MAX: 17.77MIN: 14.31 / MAX: 16.35MIN: 9.51 / MAX: 12.4

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5RTX 4090rtx3090RTX 4080RTX 3090RTX 3080 TiRTX 3080TITAN RTXRTX 2080 TiRTX 2080 SUPERRTX 3070 TiRTX 307030000M60000M90000M120000M150000MSE +/- 174184201.51, N = 7SE +/- 8485707257.57, N = 16SE +/- 22189456.96, N = 6SE +/- 71473922.59, N = 6SE +/- 76439755.87, N = 6SE +/- 127337788.10, N = 6SE +/- 90205171.38, N = 6SE +/- 87506009.00, N = 6SE +/- 57372010.69, N = 6SE +/- 65387911.30, N = 6SE +/- 34079293.97, N = 615612857142999587506250937232000007136268333367778783333602712166675850288333355854416667432723833334283771666741525483333

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512RTX 4090rtx3090RTX 4080RTX 3090RTX 3080 TiRTX 3080TITAN RTXRTX 2080 TiRTX 2080 SUPERRTX 3070 TiRTX 30701400M2800M4200M5600M7000MSE +/- 5095619.03, N = 6SE +/- 15974806.55, N = 3SE +/- 353474.81, N = 6SE +/- 2160812.60, N = 6SE +/- 3226702.82, N = 6SE +/- 825294.56, N = 6SE +/- 3557339.83, N = 6SE +/- 4024377.11, N = 6SE +/- 969335.40, N = 6SE +/- 2275192.30, N = 6SE +/- 1615480.66, N = 663550000006067866667380408333328242666672684816667238613333323172000002215683333171968333316760500001641566667

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA1RTX 4090rtx3090RTX 4080RTX 3090RTX 3080 TiRTX 3080TITAN RTXRTX 2080 TiRTX 2080 SUPERRTX 3070 TiRTX 307011000M22000M33000M44000M55000MSE +/- 5632425.56, N = 7SE +/- 113872228.59, N = 3SE +/- 11540238.78, N = 6SE +/- 20798440.11, N = 6SE +/- 27127150.20, N = 6SE +/- 26246622.00, N = 6SE +/- 34235952.51, N = 6SE +/- 28508810.53, N = 6SE +/- 2530173.47, N = 6SE +/- 7505316.63, N = 6SE +/- 16362322.91, N = 64997184285742112533333298714666672256653333321411783333190023500001846366666717634283333135712666671326146666713031483333

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSRTX 4090rtx3090RTX 4080RTX 3090RTX 3080 TiRTX 3080RTX 2080 TiTITAN RTXRTX 3070 TiRTX 2080 SUPERRTX 3070400K800K1200K1600K2000KSE +/- 4362.24, N = 9SE +/- 3556.37, N = 3SE +/- 9554.03, N = 8SE +/- 5517.68, N = 7SE +/- 1888.10, N = 8SE +/- 3279.36, N = 8SE +/- 3242.17, N = 9SE +/- 5598.71, N = 15SE +/- 1425.34, N = 8SE +/- 1162.88, N = 8SE +/- 1619.41, N = 8187997815804671115175820300796125701625635433627347503813503488490700

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: 7-ZipRTX 4090rtx3090RTX 4080RTX 3090RTX 3080 TiRTX 3080TITAN RTXRTX 2080 TiRTX 3070 TiRTX 2080 SUPERRTX 3070600K1200K1800K2400K3000KSE +/- 1573.52, N = 9SE +/- 3493.01, N = 3SE +/- 798.87, N = 8SE +/- 1016.27, N = 9SE +/- 593.28, N = 9SE +/- 691.17, N = 9SE +/- 721.96, N = 8SE +/- 618.97, N = 8SE +/- 623.61, N = 9SE +/- 625.41, N = 9SE +/- 814.96, N = 9274731121455331693263116014411154441002178940213890525729067716656689100

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NRTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080TITAN RTXRTX 2080 TiRTX 3070 TiRTX 2080 SUPERRTX 30706K12K18K24K30KSE +/- 123.24, N = 15SE +/- 171.04, N = 15SE +/- 9.21, N = 12SE +/- 23.31, N = 12SE +/- 20.26, N = 11SE +/- 26.94, N = 15SE +/- 4.89, N = 10SE +/- 4.57, N = 11SE +/- 1.15, N = 10SE +/- 12.30, N = 1028304.9016157.508248.128184.246454.055050.994802.224717.353700.723648.341. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Target: OpenCL - Benchmark: GEMM SGEMM_N

rtx3090: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTRTX 4090RTX 4080RTX 3090rtx3090RTX 3080 TiRTX 3080TITAN RTXRTX 2080 TiRTX 3070 TiRTX 2080 SUPERRTX 30709K18K27K36K45KSE +/- 173.91, N = 13SE +/- 60.93, N = 13SE +/- 123.02, N = 12SE +/- 192.00, N = 3SE +/- 110.25, N = 12SE +/- 103.33, N = 15SE +/- 147.60, N = 15SE +/- 114.69, N = 15SE +/- 37.91, N = 13SE +/- 81.62, N = 15SE +/- 57.96, N = 1341543.6624329.7818108.4418081.9317294.4415456.5313843.9612958.3810912.8910370.0810240.621. (CXX) g++ options: -O3 -rdynamic -lOpenCL

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionRTX 4090RTX 4080RTX 3090RTX 3080RTX 3080 TiRTX 3070 TiRTX 2080 TiTITAN RTXRTX 2080 SUPERRTX 30702004006008001000SE +/- 3.20, N = 15SE +/- 0.19, N = 13SE +/- 0.23, N = 13SE +/- 0.07, N = 13SE +/- 0.04, N = 13SE +/- 0.11, N = 13SE +/- 0.08, N = 13SE +/- 0.15, N = 13SE +/- 0.09, N = 12SE +/- 0.28, N = 121012.09612.65396.22393.90390.10383.88380.03370.28342.28329.101. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Target: OpenCL - Benchmark: Reduction

rtx3090: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPRTX 4090RTX 3090RTX 3080 TiRTX 4080RTX 3080TITAN RTXRTX 2080 TiRTX 3070 TiRTX 2080 SUPERRTX 30706001200180024003000SE +/- 1.30, N = 13SE +/- 0.18, N = 13SE +/- 0.15, N = 13SE +/- 1.29, N = 13SE +/- 0.13, N = 13SE +/- 1.10, N = 13SE +/- 0.74, N = 13SE +/- 0.04, N = 13SE +/- 0.66, N = 13SE +/- 0.25, N = 132794.912103.352016.911816.021763.041560.811474.311292.321196.141139.301. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Target: OpenCL - Benchmark: FFT SP

rtx3090: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatRTX 4090RTX 4080rtx3090RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070TITAN RTXRTX 2080 TiRTX 2080 SUPER20K40K60K80K100KSE +/- 6.84, N = 15SE +/- 23.64, N = 15SE +/- 90.58, N = 3SE +/- 29.26, N = 14SE +/- 24.64, N = 15SE +/- 8.73, N = 13SE +/- 10.70, N = 15SE +/- 3.16, N = 14SE +/- 168.44, N = 15SE +/- 183.59, N = 15SE +/- 70.04, N = 1581720.0048185.0635317.7935152.1433745.9929451.3121506.0220086.2214568.4013451.5510385.511. (CXX) g++ options: -O3 -rdynamic -lOpenCL

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DRTX 4090RTX 3090RTX 4080RTX 3080 TiRTX 3080TITAN RTXRTX 3070 TiRTX 2080 TiRTX 3070RTX 2080 SUPER140280420560700SE +/- 0.14, N = 13SE +/- 0.36, N = 14SE +/- 0.23, N = 14SE +/- 0.11, N = 14SE +/- 0.05, N = 14SE +/- 0.08, N = 13SE +/- 0.06, N = 13SE +/- 0.13, N = 13SE +/- 0.09, N = 13SE +/- 0.12, N = 13648.56429.20426.94418.48340.55288.95286.35271.62219.70212.551. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Target: OpenCL - Benchmark: S3D

rtx3090: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashRTX 4090RTX 4080RTX 3090RTX 3080 TiRTX 3080TITAN RTXRTX 2080 TiRTX 2080 SUPERRTX 3070 TiRTX 307020406080100SE +/- 0.94, N = 15SE +/- 0.01, N = 15SE +/- 0.00, N = 14SE +/- 0.04, N = 14SE +/- 0.02, N = 15SE +/- 0.04, N = 14SE +/- 0.03, N = 14SE +/- 0.02, N = 13SE +/- 0.02, N = 14SE +/- 0.01, N = 1393.1259.3044.1241.8137.4736.5234.7426.4026.1025.521. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Target: OpenCL - Benchmark: MD5 Hash

rtx3090: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./shoc: 3: ./bin/shocdriver: not found

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyRTX 4080RTX 4090RTX 3070 TiRTX 3070RTX 3080RTX 3080 TiRTX 3090RTX 2080 SUPERTITAN RTXRTX 2080 Tirtx3090246810SE +/- 0.00, N = 15SE +/- 0.01, N = 15SE +/- 0.01, N = 15SE +/- 0.01, N = 15SE +/- 0.01, N = 15SE +/- 0.01, N = 15SE +/- 0.01, N = 15SE +/- 0.01, N = 15SE +/- 0.01, N = 15SE +/- 0.01, N = 15SE +/- 0.00, N = 33.683.733.974.024.024.084.134.194.434.476.121. (CXX) g++ options: -O3 -rdynamic -lOpenCL

GPU Temperature Monitor

OpenBenchmarking.orgCelsiusGPU Temperature MonitorPhoronix Test Suite System MonitoringRTX 4080RTX 4090RTX 3070RTX 2080 SUPERRTX 3090RTX 3070 TiRTX 3080RTX 3080 TiRTX 2080 TiTITAN RTX1632486480Min: 23 / Avg: 45.77 / Max: 61Min: 31 / Avg: 47.84 / Max: 66Min: 25 / Avg: 59.42 / Max: 75Min: 29 / Avg: 60.65 / Max: 77Min: 24 / Avg: 61.23 / Max: 71Min: 29 / Avg: 66.31 / Max: 80Min: 25 / Avg: 66.91 / Max: 80Min: 28 / Avg: 67.74 / Max: 80Min: 34 / Avg: 67.84 / Max: 84Min: 37 / Avg: 68.17 / Max: 83

GPU Power Consumption Monitor

OpenBenchmarking.orgWattsGPU Power Consumption MonitorPhoronix Test Suite System MonitoringRTX 3070RTX 2080 SUPERRTX 4080RTX 4090RTX 2080 TiRTX 3070 TiTITAN RTXRTX 3080RTX 3080 TiRTX 309080160240320400Min: 9.55 / Avg: 124.92 / Max: 219.88Min: 9.54 / Avg: 128.61 / Max: 255.9Min: 8.71 / Avg: 129.34 / Max: 305.29Min: 8.53 / Avg: 153.59 / Max: 447.97Min: 6.2 / Avg: 163.81 / Max: 270.77Min: 13.25 / Avg: 169.39 / Max: 290.91Min: 8.25 / Avg: 179.55 / Max: 358.29Min: 11.12 / Avg: 209.84 / Max: 321.68Min: 19.24 / Avg: 235.21 / Max: 353.12Min: 11.8 / Avg: 236.01 / Max: 350.22