gpu AMD Ryzen Threadripper PRO 3955WX 16-Cores testing with a LENOVO 1046 (S07KT23A BIOS) and NVIDIA Quadro RTX 4000 8GB on Ubuntu 20.04 via the Phoronix Test Suite. NVIDIA Quadro RTX 4000: Processor: AMD Ryzen Threadripper PRO 3955WX 16-Cores @ 3.90GHz (16 Cores / 32 Threads), Motherboard: LENOVO 1046 (S07KT23A BIOS), Chipset: AMD Starship/Matisse, Memory: 64GB, Disk: 1024GB SAMSUNG MZVLB1T0HBLR-000L7 + 1000GB Samsung SSD 870, Graphics: NVIDIA Quadro RTX 4000 8GB, Audio: NVIDIA TU104 HD Audio, Network: Aquantia AQC107 NBase-T/IEEE OS: Ubuntu 20.04, Kernel: 5.10.0-1050-oem (x86_64), Desktop: GNOME Shell 3.36.9, Display Server: X Server 1.20.11, Display Driver: NVIDIA, OpenGL: 4.6.0, OpenCL: OpenCL 1.2 CUDA 11.2.162, Vulkan: 1.2.155, Compiler: GCC 9.3.0 + CUDA 10.1, File-System: ext4, Screen Resolution: 4720x1440 Hashcat 6.2.4 Benchmark: MD5 H/s > Higher Is Better NVIDIA Quadro RTX 4000 . 25698733333 |========================================= Hashcat 6.2.4 Benchmark: SHA1 H/s > Higher Is Better NVIDIA Quadro RTX 4000 . 8770800000 |========================================== Hashcat 6.2.4 Benchmark: 7-Zip H/s > Higher Is Better NVIDIA Quadro RTX 4000 . 447300 |============================================== Hashcat 6.2.4 Benchmark: SHA-512 H/s > Higher Is Better NVIDIA Quadro RTX 4000 . 1228433333 |========================================== Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS H/s > Higher Is Better NVIDIA Quadro RTX 4000 . 321367 |============================================== Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Integer Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Integer Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Half Precision Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Double Precision Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Single Precision SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D GFLOPS > Higher Is Better NVIDIA Quadro RTX 4000 . 169.69 |============================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 12.72 |=============================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP GFLOPS > Higher Is Better NVIDIA Quadro RTX 4000 . 743.72 |============================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash GHash/s > Higher Is Better NVIDIA Quadro RTX 4000 . 17.26 |=============================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 309.45 |============================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N GFLOPS > Higher Is Better NVIDIA Quadro RTX 4000 . 2924.79 |============================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops GFLOPS > Higher Is Better NVIDIA Quadro RTX 4000 . 8489.48 |============================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 13.10 |=============================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 13.54 |=============================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 1081.51 |============================================= cl-mem 2017-01-13 Benchmark: Copy GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 280.6 |=============================================== cl-mem 2017-01-13 Benchmark: Read GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 378.2 |=============================================== cl-mem 2017-01-13 Benchmark: Write GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 310.4 |=============================================== NAMD CUDA 2.14 ATPase Simulation - 327,506 Atoms days/ns < Lower Is Better NVIDIA Quadro RTX 4000 . 0.15891 |============================================= OctaneBench 2020.1 Total Score Score > Higher Is Better NVIDIA Quadro RTX 4000 . 243.76 |============================================== RedShift Demo 3.0 Seconds < Lower Is Better FAHBench 2.3.2 Ns Per Day > Higher Is Better NVIDIA Quadro RTX 4000 . 198.41 |============================================== Rodinia 3.1 Test: OpenCL Particle Filter Seconds < Lower Is Better NVIDIA Quadro RTX 4000 . 7.924 |=============================================== FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL ms < Lower Is Better NVIDIA Quadro RTX 4000 . 22.41 |=============================================== ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 84.8 |================================================ ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 127 |================================================= ViennaCL 1.7.1 Test: CPU BLAS - sDOT GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 134 |================================================= ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 34.0 |================================================ ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 55.0 |================================================ ViennaCL 1.7.1 Test: CPU BLAS - dDOT GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 64.3 |================================================ ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 57.0 |================================================ ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 70.4 |================================================ ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GFLOPs/s > Higher Is Better NVIDIA Quadro RTX 4000 . 80.9 |================================================ ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GFLOPs/s > Higher Is Better NVIDIA Quadro RTX 4000 . 82.9 |================================================ ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GFLOPs/s > Higher Is Better NVIDIA Quadro RTX 4000 . 85.5 |================================================ ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GFLOPs/s > Higher Is Better NVIDIA Quadro RTX 4000 . 82.8 |================================================ ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 257 |================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 315 |================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 242 |================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 353 |================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 370 |================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 384 |================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 357 |================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T GB/s > Higher Is Better NVIDIA Quadro RTX 4000 . 313 |================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN GFLOPs/s > Higher Is Better NVIDIA Quadro RTX 4000 . 252 |================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT GFLOPs/s > Higher Is Better NVIDIA Quadro RTX 4000 . 254 |================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN GFLOPs/s > Higher Is Better NVIDIA Quadro RTX 4000 . 250 |================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT GFLOPs/s > Higher Is Better NVIDIA Quadro RTX 4000 . 251 |================================================= GROMACS 2021.2 Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare Ns Per Day > Higher Is Better PlaidML FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL Examples Per Second > Higher Is Better IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Bedroom M samples/s > Higher Is Better NVIDIA Quadro RTX 4000 . 7.152 |=============================================== IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Supercar M samples/s > Higher Is Better NVIDIA Quadro RTX 4000 . 22.76 |=============================================== Blender 2.92 Blend File: BMW27 - Compute: CUDA Seconds < Lower Is Better NVIDIA Quadro RTX 4000 . 54.89 |=============================================== Blender 2.92 Blend File: Classroom - Compute: CUDA Seconds < Lower Is Better NVIDIA Quadro RTX 4000 . 210.61 |============================================== Blender 2.92 Blend File: Fishy Cat - Compute: CUDA Seconds < Lower Is Better NVIDIA Quadro RTX 4000 . 107.47 |============================================== Blender 2.92 Blend File: Barbershop - Compute: CUDA Seconds < Lower Is Better Blender 2.92 Blend File: BMW27 - Compute: NVIDIA OptiX Seconds < Lower Is Better NVIDIA Quadro RTX 4000 . 32.26 |=============================================== Blender 2.92 Blend File: Classroom - Compute: NVIDIA OptiX Seconds < Lower Is Better NVIDIA Quadro RTX 4000 . 109.11 |============================================== Blender 2.92 Blend File: Fishy Cat - Compute: NVIDIA OptiX Seconds < Lower Is Better NVIDIA Quadro RTX 4000 . 66.62 |=============================================== Blender 2.92 Blend File: Barbershop - Compute: NVIDIA OptiX Seconds < Lower Is Better Blender 2.92 Blend File: Pabellon Barcelona - Compute: CUDA Seconds < Lower Is Better NVIDIA Quadro RTX 4000 . 410.86 |============================================== Blender 2.92 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX Seconds < Lower Is Better NVIDIA Quadro RTX 4000 . 151.98 |============================================== MandelGPU 1.3pts1 OpenCL Device: GPU Samples/sec > Higher Is Better NVIDIA Quadro RTX 4000 . 261736699.4 |========================================= clpeak OpenCL Test: Integer Compute INT GIOPS > Higher Is Better NVIDIA Quadro RTX 4000 . 6099.59 |============================================= clpeak OpenCL Test: Single-Precision Float GFLOPS > Higher Is Better NVIDIA Quadro RTX 4000 . 6169.71 |============================================= clpeak OpenCL Test: Double-Precision Double GFLOPS > Higher Is Better NVIDIA Quadro RTX 4000 . 267.36 |============================================== clpeak OpenCL Test: Global Memory Bandwidth GBPS > Higher Is Better NVIDIA Quadro RTX 4000 . 343.41 |============================================== NeatBench 5 Acceleration: GPU FPS > Higher Is Better NVIDIA Quadro RTX 4000 . 31.1 |================================================