gpu

AMD Ryzen Threadripper PRO 3955WX 16-Cores testing with a LENOVO 1046 (S07KT23A BIOS) and NVIDIA Quadro RTX 4000 8GB on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2110272-TJ-GPU16281353
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
NVIDIA Quadro RTX 4000
October 26 2021
  2 Hours, 44 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


gpuOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen Threadripper PRO 3955WX 16-Cores @ 3.90GHz (16 Cores / 32 Threads)LENOVO 1046 (S07KT23A BIOS)AMD Starship/Matisse64GB1024GB SAMSUNG MZVLB1T0HBLR-000L7 + 1000GB Samsung SSD 870NVIDIA Quadro RTX 4000 8GBNVIDIA TU104 HD AudioAquantia AQC107 NBase-T/IEEEUbuntu 20.045.10.0-1050-oem (x86_64)GNOME Shell 3.36.9X Server 1.20.11NVIDIA4.6.0OpenCL 1.2 CUDA 11.2.1621.2.155GCC 9.3.0 + CUDA 10.1ext44720x1440ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionGpu BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x830104d- BAR1 / Visible vRAM Size: 256 MiB- Python 3.8.10- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

gpuhashcat: MD5hashcat: SHA1hashcat: 7-Ziphashcat: SHA-512hashcat: TrueCrypt RIPEMD160 + XTSshoc: OpenCL - S3Dshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Reductionshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthcl-mem: Copycl-mem: Readcl-mem: Writenamd-cuda: ATPase Simulation - 327,506 Atomsoctanebench: Total Scorefahbench: rodinia: OpenCL Particle Filterfinancebench: Black-Scholes OpenCLviennacl: CPU BLAS - sCOPYviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-TTviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-TTindigobench: OpenCL GPU - Bedroomindigobench: OpenCL GPU - Supercarblender: BMW27 - CUDAblender: Classroom - CUDAblender: Fishy Cat - CUDAblender: BMW27 - NVIDIA OptiXblender: Classroom - NVIDIA OptiXblender: Fishy Cat - NVIDIA OptiXblender: Pabellon Barcelona - CUDAblender: Pabellon Barcelona - NVIDIA OptiXmandelgpu: GPUclpeak: Integer Compute INTclpeak: Single-Precision Floatclpeak: Double-Precision Doubleclpeak: Global Memory Bandwidthneatbench: GPUNVIDIA Quadro RTX 40002569873333387708000004473001228433333321367169.69112.7192743.72417.2560309.4492924.798489.4813.099813.53831081.51280.6378.2310.40.15891243.758913198.40577.92422.41284.812713434.055.064.357.070.480.982.985.582.82573152423533703843573132522542502517.15222.76454.89210.61107.4732.26109.1166.62410.86151.98261736699.46099.596169.71267.36343.4131.1OpenBenchmarking.org

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5NVIDIA Quadro RTX 40006000M12000M18000M24000M30000MSE +/- 135854386.92, N = 325698733333

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA1NVIDIA Quadro RTX 40002000M4000M6000M8000M10000MSE +/- 37758310.34, N = 38770800000

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: 7-ZipNVIDIA Quadro RTX 4000100K200K300K400K500KSE +/- 721.11, N = 3447300

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512NVIDIA Quadro RTX 4000300M600M900M1200M1500MSE +/- 3860195.73, N = 31228433333

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSNVIDIA Quadro RTX 400070K140K210K280K350KSE +/- 433.33, N = 3321367

Mixbench

A benchmark suite for GPUs on mixed operational intensity kernels. Learn more via the OpenBenchmarking.org test page.

Backend: OpenCL - Benchmark: Integer

NVIDIA Quadro RTX 4000: ./mixbench: 3: ./mixbench-ocl-ro: not found

Backend: NVIDIA CUDA - Benchmark: Integer

NVIDIA Quadro RTX 4000: ./mixbench: 3: ./mixbench-cuda-ro: not found

Backend: OpenCL - Benchmark: Double Precision

NVIDIA Quadro RTX 4000: ./mixbench: 3: ./mixbench-ocl-ro: not found

Backend: OpenCL - Benchmark: Single Precision

NVIDIA Quadro RTX 4000: ./mixbench: 3: ./mixbench-ocl-ro: not found

Backend: NVIDIA CUDA - Benchmark: Half Precision

NVIDIA Quadro RTX 4000: ./mixbench: 3: ./mixbench-cuda-ro: not found

Backend: NVIDIA CUDA - Benchmark: Double Precision

NVIDIA Quadro RTX 4000: ./mixbench: 3: ./mixbench-cuda-ro: not found

Backend: NVIDIA CUDA - Benchmark: Single Precision

NVIDIA Quadro RTX 4000: ./mixbench: 3: ./mixbench-cuda-ro: not found

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DNVIDIA Quadro RTX 40004080120160200SE +/- 0.01, N = 3169.691. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadNVIDIA Quadro RTX 40003691215SE +/- 0.00, N = 312.721. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPNVIDIA Quadro RTX 4000160320480640800SE +/- 7.49, N = 13743.721. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashNVIDIA Quadro RTX 400048121620SE +/- 0.00, N = 317.261. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionNVIDIA Quadro RTX 400070140210280350SE +/- 0.11, N = 3309.451. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NNVIDIA Quadro RTX 40006001200180024003000SE +/- 15.60, N = 32924.791. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP FlopsNVIDIA Quadro RTX 40002K4K6K8K10KSE +/- 49.72, N = 38489.481. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadNVIDIA Quadro RTX 40003691215SE +/- 0.01, N = 313.101. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackNVIDIA Quadro RTX 40003691215SE +/- 0.00, N = 313.541. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthNVIDIA Quadro RTX 40002004006008001000SE +/- 2.22, N = 31081.511. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyNVIDIA Quadro RTX 400060120180240300SE +/- 0.09, N = 3280.61. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadNVIDIA Quadro RTX 400080160240320400SE +/- 0.12, N = 3378.21. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteNVIDIA Quadro RTX 400070140210280350SE +/- 1.75, N = 3310.41. (CC) gcc options: -O2 -flto -lOpenCL

NAMD CUDA

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. This version of the NAMD test profile uses CUDA GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 AtomsNVIDIA Quadro RTX 40000.03580.07160.10740.14320.179SE +/- 0.00030, N = 30.15891

OctaneBench

OctaneBench is a test of the OctaneRender on the GPU and requires the use of NVIDIA CUDA. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterOctaneBench 2020.1Total ScoreNVIDIA Quadro RTX 400050100150200250243.76

RedShift Demo

This is a test of MAXON's RedShift demo build that currently requires NVIDIA GPU acceleration. Learn more via the OpenBenchmarking.org test page.

NVIDIA Quadro RTX 4000: The test quit with a non-zero exit status.

FAHBench

FAHBench is a Folding@Home benchmark on the GPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2NVIDIA Quadro RTX 40004080120160200SE +/- 0.38, N = 3198.41

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterNVIDIA Quadro RTX 4000246810SE +/- 0.089, N = 37.9241. (CXX) g++ options: -O2 -lOpenCL

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Black-Scholes OpenCLNVIDIA Quadro RTX 4000510152025SE +/- 0.19, N = 322.411. (CXX) g++ options: -O3 -march=native -fopenmp

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYNVIDIA Quadro RTX 400020406080100SE +/- 0.26, N = 384.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYNVIDIA Quadro RTX 40003060901201501271. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTNVIDIA Quadro RTX 4000306090120150SE +/- 0.67, N = 31341. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYNVIDIA Quadro RTX 4000816243240SE +/- 3.75, N = 334.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYNVIDIA Quadro RTX 40001224364860SE +/- 7.45, N = 355.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTNVIDIA Quadro RTX 40001428425670SE +/- 0.69, N = 364.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NNVIDIA Quadro RTX 40001326395265SE +/- 7.07, N = 357.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TNVIDIA Quadro RTX 40001632486480SE +/- 0.86, N = 370.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNNVIDIA Quadro RTX 400020406080100SE +/- 1.41, N = 380.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTNVIDIA Quadro RTX 400020406080100SE +/- 0.54, N = 382.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNNVIDIA Quadro RTX 400020406080100SE +/- 3.30, N = 385.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTNVIDIA Quadro RTX 400020406080100SE +/- 3.32, N = 382.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYNVIDIA Quadro RTX 400060120180240300SE +/- 1.86, N = 32571. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYNVIDIA Quadro RTX 400070140210280350SE +/- 0.33, N = 33151. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTNVIDIA Quadro RTX 400050100150200250SE +/- 0.67, N = 32421. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYNVIDIA Quadro RTX 4000801602403204003531. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYNVIDIA Quadro RTX 4000801602403204003701. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTNVIDIA Quadro RTX 4000801602403204003841. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NNVIDIA Quadro RTX 400080160240320400SE +/- 1.45, N = 33571. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TNVIDIA Quadro RTX 400070140210280350SE +/- 1.00, N = 33131. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNNVIDIA Quadro RTX 400060120180240300SE +/- 1.76, N = 32521. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTNVIDIA Quadro RTX 4000601201802403002541. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNNVIDIA Quadro RTX 400050100150200250SE +/- 1.45, N = 32501. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTNVIDIA Quadro RTX 400050100150200250SE +/- 1.00, N = 22511. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare

NVIDIA Quadro RTX 4000: /cuda-build/run-gromacs: 3: /cuda-build/bin/gmx: not found

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL

NVIDIA Quadro RTX 4000: Test failed to run.

FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL

NVIDIA Quadro RTX 4000: Test failed to run.

FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL

NVIDIA Quadro RTX 4000: Test failed to run.

FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL

NVIDIA Quadro RTX 4000: Test failed to run.

FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL

NVIDIA Quadro RTX 4000: Test failed to run.

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: BedroomNVIDIA Quadro RTX 4000246810SE +/- 0.011, N = 37.152

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: SupercarNVIDIA Quadro RTX 4000510152025SE +/- 0.02, N = 322.76

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL, NVIDIA OptiX, and NVIDIA CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: CUDANVIDIA Quadro RTX 40001224364860SE +/- 0.11, N = 354.89

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Classroom - Compute: CUDANVIDIA Quadro RTX 400050100150200250SE +/- 0.03, N = 3210.61

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Fishy Cat - Compute: CUDANVIDIA Quadro RTX 400020406080100SE +/- 0.14, N = 3107.47

Blend File: Barbershop - Compute: CUDA

NVIDIA Quadro RTX 4000: Error: Cannot read file 'blender-2.92.0-linux64/../barbershop_interior_gpu.blend': No such file or directory

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: NVIDIA OptiXNVIDIA Quadro RTX 4000714212835SE +/- 4.11, N = 1532.26

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Classroom - Compute: NVIDIA OptiXNVIDIA Quadro RTX 400020406080100SE +/- 0.15, N = 3109.11

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Fishy Cat - Compute: NVIDIA OptiXNVIDIA Quadro RTX 40001530456075SE +/- 4.05, N = 1566.62

Blend File: Barbershop - Compute: NVIDIA OptiX

NVIDIA Quadro RTX 4000: Error: Cannot read file 'blender-2.92.0-linux64/../barbershop_interior_gpu.blend': No such file or directory

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Pabellon Barcelona - Compute: CUDANVIDIA Quadro RTX 400090180270360450SE +/- 0.21, N = 3410.86

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXNVIDIA Quadro RTX 4000306090120150SE +/- 0.11, N = 3151.98

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUNVIDIA Quadro RTX 400060M120M180M240M300MSE +/- 2930354.60, N = 3261736699.41. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTNVIDIA Quadro RTX 400013002600390052006500SE +/- 82.36, N = 156099.591. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatNVIDIA Quadro RTX 400013002600390052006500SE +/- 103.56, N = 156169.711. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleNVIDIA Quadro RTX 400060120180240300SE +/- 1.42, N = 3267.361. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthNVIDIA Quadro RTX 400070140210280350SE +/- 4.52, N = 3343.411. (CXX) g++ options: -O3 -rdynamic -lOpenCL

NeatBench

NeatBench is a benchmark of the cross-platform Neat Video software on the CPU and optional GPU (OpenCL / CUDA) support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: GPUNVIDIA Quadro RTX 4000714212835SE +/- 0.55, N = 1531.1