NVIDIA GPU Compute Benchmarks
Benchmarks for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2106127-IB-3080C630035&grs.
ViennaCL
Test: OpenCL BLAS - dGEMM-TT
ViennaCL
Test: OpenCL BLAS - dGEMV-T
ViennaCL
Test: OpenCL BLAS - dGEMV-N
ViennaCL
Test: OpenCL BLAS - sDOT
RedShift Demo
Betsy GPU Compressor
Codec: ETC2 RGB - Quality: Highest
Betsy GPU Compressor
Codec: ETC1 - Quality: Highest
Waifu2x-NCNN Vulkan
Scale: 2x - Denoise: 3 - TAA: Yes
RealSR-NCNN
Scale: 4x - TAA: No
RealSR-NCNN
Scale: 4x - TAA: Yes
VkResample
Upscale: 2x - Precision: Single
Chaos Group V-RAY
Mode: NVIDIA RTX GPU
Chaos Group V-RAY
Mode: NVIDIA CUDA GPU
Blender
Blend File: Barbershop - Compute: NVIDIA OptiX
Blender
Blend File: Barbershop - Compute: CUDA
Blender
Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX
Blender
Blend File: Pabellon Barcelona - Compute: CUDA
Blender
Blend File: Fishy Cat - Compute: NVIDIA OptiX
Blender
Blend File: Fishy Cat - Compute: CUDA
Blender
Blend File: Classroom - Compute: NVIDIA OptiX
Blender
Blend File: Classroom - Compute: CUDA
Blender
Blend File: BMW27 - Compute: NVIDIA OptiX
Blender
Blend File: BMW27 - Compute: CUDA
IndigoBench
Acceleration: OpenCL GPU - Scene: Bedroom
IndigoBench
Acceleration: OpenCL GPU - Scene: Supercar
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: S3D
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: MD5 Hash
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: GEMM SGEMM_N
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: FFT SP
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: Texture Read Bandwidth
ViennaCL
Test: OpenCL BLAS - dGEMM-TN
ViennaCL
Test: OpenCL BLAS - dGEMM-NT
ViennaCL
Test: OpenCL BLAS - dGEMM-NN
ViennaCL
Test: OpenCL BLAS - dDOT
ViennaCL
Test: OpenCL BLAS - dAXPY
ViennaCL
Test: OpenCL BLAS - dCOPY
ViennaCL
Test: OpenCL BLAS - sAXPY
ViennaCL
Test: OpenCL BLAS - sCOPY
cl-mem
Benchmark: Copy
cl-mem
Benchmark: Write
cl-mem
Benchmark: Read
LeelaChessZero
Backend: OpenCL
PlaidML
FP16: No - Mode: Inference - Network: VGG19 - Device: OpenCL
PlaidML
FP16: No - Mode: Inference - Network: VGG16 - Device: OpenCL
PlaidML
FP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCL
vkpeak
int16-vec4
vkpeak
int16-scalar
vkpeak
int32-vec4
vkpeak
int32-scalar
vkpeak
fp64-vec4
vkpeak
fp64-scalar
vkpeak
fp16-vec4
vkpeak
fp16-scalar
vkpeak
fp32-vec4
vkpeak
fp32-scalar
clpeak
OpenCL Test: Integer Compute INT
clpeak
OpenCL Test: Double-Precision Double
clpeak
OpenCL Test: Single-Precision Float
clpeak
OpenCL Test: Global Memory Bandwidth
ArrayFire
Test: Conjugate Gradient OpenCL
ArrayFire
Test: BLAS OpenCL
LuxCoreRender
Scene: Danish Mood - Acceleration: GPU
LuxCoreRender
Scene: Orange Juice - Acceleration: GPU
LuxCoreRender
Scene: LuxCore Benchmark - Acceleration: GPU
LuxCoreRender
Scene: Rainbow Colors and Prism - Acceleration: GPU
LuxCoreRender
Scene: DLSC - Acceleration: GPU
OctaneBench
Total Score
NAMD CUDA
ATPase Simulation - 327,506 Atoms
FAHBench
Hashcat
Benchmark: TrueCrypt RIPEMD160 + XTS
Hashcat
Benchmark: 7-Zip
Hashcat
Benchmark: SHA-512
Hashcat
Benchmark: SHA1
Hashcat
Benchmark: MD5
Mixbench
Backend: NVIDIA CUDA - Benchmark: Integer
Mixbench
Backend: NVIDIA CUDA - Benchmark: Half Precision
Mixbench
Backend: NVIDIA CUDA - Benchmark: Double Precision
Mixbench
Backend: NVIDIA CUDA - Benchmark: Single Precision
Phoronix Test Suite v10.8.5