gpu AMD Ryzen Threadripper PRO 3955WX 16-Cores testing with a LENOVO 1046 (S07KT23A BIOS) and NVIDIA Quadro RTX 4000 8GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2110272-TJ-GPU16281353 .
gpu Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution NVIDIA Quadro RTX 4000 AMD Ryzen Threadripper PRO 3955WX 16-Cores @ 3.90GHz (16 Cores / 32 Threads) LENOVO 1046 (S07KT23A BIOS) AMD Starship/Matisse 64GB 1024GB SAMSUNG MZVLB1T0HBLR-000L7 + 1000GB Samsung SSD 870 NVIDIA Quadro RTX 4000 8GB NVIDIA TU104 HD Audio Aquantia AQC107 NBase-T/IEEE Ubuntu 20.04 5.10.0-1050-oem (x86_64) GNOME Shell 3.36.9 X Server 1.20.11 NVIDIA 4.6.0 OpenCL 1.2 CUDA 11.2.162 1.2.155 GCC 9.3.0 + CUDA 10.1 ext4 4720x1440 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x830104d - BAR1 / Visible vRAM Size: 256 MiB - Python 3.8.10 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
gpu hashcat: MD5 hashcat: SHA1 hashcat: 7-Zip hashcat: SHA-512 hashcat: TrueCrypt RIPEMD160 + XTS shoc: OpenCL - S3D shoc: OpenCL - Triad shoc: OpenCL - FFT SP shoc: OpenCL - MD5 Hash shoc: OpenCL - Reduction shoc: OpenCL - GEMM SGEMM_N shoc: OpenCL - Max SP Flops shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Texture Read Bandwidth cl-mem: Copy cl-mem: Read cl-mem: Write namd-cuda: ATPase Simulation - 327,506 Atoms octanebench: Total Score fahbench: rodinia: OpenCL Particle Filter financebench: Black-Scholes OpenCL viennacl: CPU BLAS - sCOPY viennacl: CPU BLAS - sAXPY viennacl: CPU BLAS - sDOT viennacl: CPU BLAS - dCOPY viennacl: CPU BLAS - dAXPY viennacl: CPU BLAS - dDOT viennacl: CPU BLAS - dGEMV-N viennacl: CPU BLAS - dGEMV-T viennacl: CPU BLAS - dGEMM-NN viennacl: CPU BLAS - dGEMM-NT viennacl: CPU BLAS - dGEMM-TN viennacl: CPU BLAS - dGEMM-TT viennacl: OpenCL BLAS - sCOPY viennacl: OpenCL BLAS - sAXPY viennacl: OpenCL BLAS - sDOT viennacl: OpenCL BLAS - dCOPY viennacl: OpenCL BLAS - dAXPY viennacl: OpenCL BLAS - dDOT viennacl: OpenCL BLAS - dGEMV-N viennacl: OpenCL BLAS - dGEMV-T viennacl: OpenCL BLAS - dGEMM-NN viennacl: OpenCL BLAS - dGEMM-NT viennacl: OpenCL BLAS - dGEMM-TN viennacl: OpenCL BLAS - dGEMM-TT indigobench: OpenCL GPU - Bedroom indigobench: OpenCL GPU - Supercar blender: BMW27 - CUDA blender: Classroom - CUDA blender: Fishy Cat - CUDA blender: BMW27 - NVIDIA OptiX blender: Classroom - NVIDIA OptiX blender: Fishy Cat - NVIDIA OptiX blender: Pabellon Barcelona - CUDA blender: Pabellon Barcelona - NVIDIA OptiX mandelgpu: GPU clpeak: Integer Compute INT clpeak: Single-Precision Float clpeak: Double-Precision Double clpeak: Global Memory Bandwidth neatbench: GPU NVIDIA Quadro RTX 4000 25698733333 8770800000 447300 1228433333 321367 169.691 12.7192 743.724 17.2560 309.449 2924.79 8489.48 13.0998 13.5383 1081.51 280.6 378.2 310.4 0.15891 243.758913 198.4057 7.924 22.412 84.8 127 134 34.0 55.0 64.3 57.0 70.4 80.9 82.9 85.5 82.8 257 315 242 353 370 384 357 313 252 254 250 251 7.152 22.764 54.89 210.61 107.47 32.26 109.11 66.62 410.86 151.98 261736699.4 6099.59 6169.71 267.36 343.41 31.1 OpenBenchmarking.org
Hashcat Benchmark: MD5 OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: MD5 NVIDIA Quadro RTX 4000 6000M 12000M 18000M 24000M 30000M SE +/- 135854386.92, N = 3 25698733333
Hashcat Benchmark: SHA1 OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: SHA1 NVIDIA Quadro RTX 4000 2000M 4000M 6000M 8000M 10000M SE +/- 37758310.34, N = 3 8770800000
Hashcat Benchmark: 7-Zip OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: 7-Zip NVIDIA Quadro RTX 4000 100K 200K 300K 400K 500K SE +/- 721.11, N = 3 447300
Hashcat Benchmark: SHA-512 OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: SHA-512 NVIDIA Quadro RTX 4000 300M 600M 900M 1200M 1500M SE +/- 3860195.73, N = 3 1228433333
Hashcat Benchmark: TrueCrypt RIPEMD160 + XTS OpenBenchmarking.org H/s, More Is Better Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS NVIDIA Quadro RTX 4000 70K 140K 210K 280K 350K SE +/- 433.33, N = 3 321367
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: S3D OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D NVIDIA Quadro RTX 4000 40 80 120 160 200 SE +/- 0.01, N = 3 169.69 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad NVIDIA Quadro RTX 4000 3 6 9 12 15 SE +/- 0.00, N = 3 12.72 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP NVIDIA Quadro RTX 4000 160 320 480 640 800 SE +/- 7.49, N = 13 743.72 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash NVIDIA Quadro RTX 4000 4 8 12 16 20 SE +/- 0.00, N = 3 17.26 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Reduction OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction NVIDIA Quadro RTX 4000 70 140 210 280 350 SE +/- 0.11, N = 3 309.45 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: GEMM SGEMM_N OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N NVIDIA Quadro RTX 4000 600 1200 1800 2400 3000 SE +/- 15.60, N = 3 2924.79 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops NVIDIA Quadro RTX 4000 2K 4K 6K 8K 10K SE +/- 49.72, N = 3 8489.48 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download NVIDIA Quadro RTX 4000 3 6 9 12 15 SE +/- 0.01, N = 3 13.10 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback NVIDIA Quadro RTX 4000 3 6 9 12 15 SE +/- 0.00, N = 3 13.54 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth NVIDIA Quadro RTX 4000 200 400 600 800 1000 SE +/- 2.22, N = 3 1081.51 1. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy NVIDIA Quadro RTX 4000 60 120 180 240 300 SE +/- 0.09, N = 3 280.6 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read NVIDIA Quadro RTX 4000 80 160 240 320 400 SE +/- 0.12, N = 3 378.2 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write NVIDIA Quadro RTX 4000 70 140 210 280 350 SE +/- 1.75, N = 3 310.4 1. (CC) gcc options: -O2 -flto -lOpenCL
NAMD CUDA ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD CUDA 2.14 ATPase Simulation - 327,506 Atoms NVIDIA Quadro RTX 4000 0.0358 0.0716 0.1074 0.1432 0.179 SE +/- 0.00030, N = 3 0.15891
OctaneBench Total Score OpenBenchmarking.org Score, More Is Better OctaneBench 2020.1 Total Score NVIDIA Quadro RTX 4000 50 100 150 200 250 243.76
FAHBench OpenBenchmarking.org Ns Per Day, More Is Better FAHBench 2.3.2 NVIDIA Quadro RTX 4000 40 80 120 160 200 SE +/- 0.38, N = 3 198.41
Rodinia Test: OpenCL Particle Filter OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Particle Filter NVIDIA Quadro RTX 4000 2 4 6 8 10 SE +/- 0.089, N = 3 7.924 1. (CXX) g++ options: -O2 -lOpenCL
FinanceBench Benchmark: Black-Scholes OpenCL OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL NVIDIA Quadro RTX 4000 5 10 15 20 25 SE +/- 0.19, N = 3 22.41 1. (CXX) g++ options: -O3 -march=native -fopenmp
ViennaCL Test: CPU BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY NVIDIA Quadro RTX 4000 20 40 60 80 100 SE +/- 0.26, N = 3 84.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY NVIDIA Quadro RTX 4000 30 60 90 120 150 127 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT NVIDIA Quadro RTX 4000 30 60 90 120 150 SE +/- 0.67, N = 3 134 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY NVIDIA Quadro RTX 4000 8 16 24 32 40 SE +/- 3.75, N = 3 34.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY NVIDIA Quadro RTX 4000 12 24 36 48 60 SE +/- 7.45, N = 3 55.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT NVIDIA Quadro RTX 4000 14 28 42 56 70 SE +/- 0.69, N = 3 64.3 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N NVIDIA Quadro RTX 4000 13 26 39 52 65 SE +/- 7.07, N = 3 57.0 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T NVIDIA Quadro RTX 4000 16 32 48 64 80 SE +/- 0.86, N = 3 70.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN NVIDIA Quadro RTX 4000 20 40 60 80 100 SE +/- 1.41, N = 3 80.9 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT NVIDIA Quadro RTX 4000 20 40 60 80 100 SE +/- 0.54, N = 3 82.9 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN NVIDIA Quadro RTX 4000 20 40 60 80 100 SE +/- 3.30, N = 3 85.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT NVIDIA Quadro RTX 4000 20 40 60 80 100 SE +/- 3.32, N = 3 82.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY NVIDIA Quadro RTX 4000 60 120 180 240 300 SE +/- 1.86, N = 3 257 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY NVIDIA Quadro RTX 4000 70 140 210 280 350 SE +/- 0.33, N = 3 315 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT NVIDIA Quadro RTX 4000 50 100 150 200 250 SE +/- 0.67, N = 3 242 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY NVIDIA Quadro RTX 4000 80 160 240 320 400 353 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY NVIDIA Quadro RTX 4000 80 160 240 320 400 370 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT NVIDIA Quadro RTX 4000 80 160 240 320 400 384 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N NVIDIA Quadro RTX 4000 80 160 240 320 400 SE +/- 1.45, N = 3 357 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T NVIDIA Quadro RTX 4000 70 140 210 280 350 SE +/- 1.00, N = 3 313 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN NVIDIA Quadro RTX 4000 60 120 180 240 300 SE +/- 1.76, N = 3 252 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT NVIDIA Quadro RTX 4000 60 120 180 240 300 254 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN NVIDIA Quadro RTX 4000 50 100 150 200 250 SE +/- 1.45, N = 3 250 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT NVIDIA Quadro RTX 4000 50 100 150 200 250 SE +/- 1.00, N = 2 251 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
IndigoBench Acceleration: OpenCL GPU - Scene: Bedroom OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Bedroom NVIDIA Quadro RTX 4000 2 4 6 8 10 SE +/- 0.011, N = 3 7.152
IndigoBench Acceleration: OpenCL GPU - Scene: Supercar OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Supercar NVIDIA Quadro RTX 4000 5 10 15 20 25 SE +/- 0.02, N = 3 22.76
Blender Blend File: BMW27 - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: BMW27 - Compute: CUDA NVIDIA Quadro RTX 4000 12 24 36 48 60 SE +/- 0.11, N = 3 54.89
Blender Blend File: Classroom - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Classroom - Compute: CUDA NVIDIA Quadro RTX 4000 50 100 150 200 250 SE +/- 0.03, N = 3 210.61
Blender Blend File: Fishy Cat - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Fishy Cat - Compute: CUDA NVIDIA Quadro RTX 4000 20 40 60 80 100 SE +/- 0.14, N = 3 107.47
Blender Blend File: BMW27 - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: BMW27 - Compute: NVIDIA OptiX NVIDIA Quadro RTX 4000 7 14 21 28 35 SE +/- 4.11, N = 15 32.26
Blender Blend File: Classroom - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Classroom - Compute: NVIDIA OptiX NVIDIA Quadro RTX 4000 20 40 60 80 100 SE +/- 0.15, N = 3 109.11
Blender Blend File: Fishy Cat - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Fishy Cat - Compute: NVIDIA OptiX NVIDIA Quadro RTX 4000 15 30 45 60 75 SE +/- 4.05, N = 15 66.62
Blender Blend File: Pabellon Barcelona - Compute: CUDA OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Pabellon Barcelona - Compute: CUDA NVIDIA Quadro RTX 4000 90 180 270 360 450 SE +/- 0.21, N = 3 410.86
Blender Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX NVIDIA Quadro RTX 4000 30 60 90 120 150 SE +/- 0.11, N = 3 151.98
MandelGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelGPU 1.3pts1 OpenCL Device: GPU NVIDIA Quadro RTX 4000 60M 120M 180M 240M 300M SE +/- 2930354.60, N = 3 261736699.4 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak OpenCL Test: Integer Compute INT NVIDIA Quadro RTX 4000 1300 2600 3900 5200 6500 SE +/- 82.36, N = 15 6099.59 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float NVIDIA Quadro RTX 4000 1300 2600 3900 5200 6500 SE +/- 103.56, N = 15 6169.71 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double NVIDIA Quadro RTX 4000 60 120 180 240 300 SE +/- 1.42, N = 3 267.36 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth NVIDIA Quadro RTX 4000 70 140 210 280 350 SE +/- 4.52, N = 3 343.41 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
NeatBench Acceleration: GPU OpenBenchmarking.org FPS, More Is Better NeatBench 5 Acceleration: GPU NVIDIA Quadro RTX 4000 7 14 21 28 35 SE +/- 0.55, N = 15 31.1
Phoronix Test Suite v10.8.4