NVIDIA GPU Compute Benchmarks

Benchmarks for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2106127-IB-3080C630035
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
RTX 3080 RBAR
June 12 2021
  3 Hours, 54 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA GPU Compute Benchmarks Suite 1.0.0 System Test suite extracted from NVIDIA GPU Compute Benchmarks. pts/plaidml-1.0.4 --no-fp16 --no-train resnet50 OPENCL FP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCL pts/plaidml-1.0.4 --no-fp16 --no-train vgg16 OPENCL FP16: No - Mode: Inference - Network: VGG16 - Device: OpenCL pts/plaidml-1.0.4 --no-fp16 --no-train vgg19 OPENCL FP16: No - Mode: Inference - Network: VGG19 - Device: OpenCL pts/cl-mem-1.0.1 READ Benchmark: Read pts/cl-mem-1.0.1 WRITE Benchmark: Write pts/cl-mem-1.0.1 COPY Benchmark: Copy pts/viennacl-1.1.0 dense_blas-bench-opencl Test: OpenCL BLAS - sCOPY pts/viennacl-1.1.0 dense_blas-bench-opencl Test: OpenCL BLAS - sAXPY pts/viennacl-1.1.0 dense_blas-bench-opencl Test: OpenCL BLAS - dCOPY pts/viennacl-1.1.0 dense_blas-bench-opencl Test: OpenCL BLAS - dAXPY pts/viennacl-1.1.0 dense_blas-bench-opencl Test: OpenCL BLAS - dDOT pts/shoc-1.2.0 -opencl -benchmark DeviceMemory Target: OpenCL - Benchmark: Texture Read Bandwidth pts/viennacl-1.1.0 dense_blas-bench-opencl Test: OpenCL BLAS - sDOT pts/viennacl-1.1.0 dense_blas-bench-opencl Test: OpenCL BLAS - dGEMV-N pts/viennacl-1.1.0 dense_blas-bench-opencl Test: OpenCL BLAS - dGEMV-T pts/clpeak-1.0.1 --global-bandwidth OpenCL Test: Global Memory Bandwidth pts/mixbench-1.1.1 mixbench-cuda-ro SPGFLOPS Backend: NVIDIA CUDA - Benchmark: Single Precision pts/mixbench-1.1.1 mixbench-cuda-ro DPGFLOPS Backend: NVIDIA CUDA - Benchmark: Double Precision pts/mixbench-1.1.1 mixbench-cuda-ro HPGFLOPS Backend: NVIDIA CUDA - Benchmark: Half Precision pts/arrayfire-1.1.0 blas_opencl Test: BLAS OpenCL pts/clpeak-1.0.1 --compute-sp OpenCL Test: Single-Precision Float pts/clpeak-1.0.1 --compute-dp OpenCL Test: Double-Precision Double pts/vkpeak-1.0.2 fp32-scalar pts/vkpeak-1.0.2 fp32-vec4 pts/vkpeak-1.0.2 fp16-scalar pts/vkpeak-1.0.2 fp16-vec4 pts/vkpeak-1.0.2 fp64-scalar pts/vkpeak-1.0.2 fp64-vec4 pts/shoc-1.2.0 -opencl -benchmark FFT Target: OpenCL - Benchmark: FFT SP pts/shoc-1.2.0 -opencl -benchmark GEMM Target: OpenCL - Benchmark: GEMM SGEMM_N pts/shoc-1.2.0 -opencl -benchmark S3D Target: OpenCL - Benchmark: S3D pts/viennacl-1.1.0 dense_blas-bench-opencl Test: OpenCL BLAS - dGEMM-NN pts/viennacl-1.1.0 dense_blas-bench-opencl Test: OpenCL BLAS - dGEMM-NT pts/viennacl-1.1.0 dense_blas-bench-opencl Test: OpenCL BLAS - dGEMM-TN pts/viennacl-1.1.0 dense_blas-bench-opencl Test: OpenCL BLAS - dGEMM-TT pts/shoc-1.2.0 -opencl -benchmark MD5Hash Target: OpenCL - Benchmark: MD5 Hash pts/mixbench-1.1.1 mixbench-cuda-ro GIOPS Backend: NVIDIA CUDA - Benchmark: Integer pts/clpeak-1.0.1 --compute-integer OpenCL Test: Integer Compute INT pts/vkpeak-1.0.2 int32-scalar pts/vkpeak-1.0.2 int32-vec4 pts/vkpeak-1.0.2 int16-scalar pts/vkpeak-1.0.2 int16-vec4 pts/hashcat-1.0.0 -m 0 Benchmark: MD5 pts/hashcat-1.0.0 -m 100 Benchmark: SHA1 pts/hashcat-1.0.0 -m 1700 Benchmark: SHA-512 pts/hashcat-1.0.0 -m 11600 Benchmark: 7-Zip pts/hashcat-1.0.0 -m 6211 Benchmark: TrueCrypt RIPEMD160 + XTS pts/indigobench-1.1.0 --gpuonly --scenes supercar Acceleration: OpenCL GPU - Scene: Supercar pts/indigobench-1.1.0 --gpuonly --scenes bedroom Acceleration: OpenCL GPU - Scene: Bedroom pts/luxcorerender-1.3.0 DLSC/LuxCoreScene/render.cfg -D renderengine.type PATHOCL -D opencl.native.threads.count 0 -D context.cuda.optix.enable 0 Scene: DLSC - Acceleration: GPU pts/luxcorerender-1.3.0 RainbowColorsAndPrism/LuxCoreScene/render.cfg -D renderengine.type PATHOCL -D opencl.native.threads.count 0 -D context.cuda.optix.enable 0 Scene: Rainbow Colors and Prism - Acceleration: GPU pts/luxcorerender-1.3.0 LuxCore2.1Benchmark/LuxCoreScene/render.cfg -D renderengine.type PATHOCL -D opencl.native.threads.count 0 -D context.cuda.optix.enable 0 Scene: LuxCore Benchmark - Acceleration: GPU pts/luxcorerender-1.3.0 OrangeJuice/LuxCoreScene/render.cfg -D renderengine.type PATHOCL -D opencl.native.threads.count 0 -D context.cuda.optix.enable 0 Scene: Orange Juice - Acceleration: GPU pts/luxcorerender-1.3.0 DanishMood/LuxCoreScene/render.cfg -D renderengine.type PATHOCL -D opencl.native.threads.count 0 -D context.cuda.optix.enable 0 Scene: Danish Mood - Acceleration: GPU pts/lczero-1.5.1 -b opencl Backend: OpenCL pts/fahbench-1.0.2 pts/octanebench-1.3.0 Total Score pts/v-ray-1.3.0 -m vray-gpu-cuda Mode: NVIDIA CUDA GPU pts/v-ray-1.3.0 -m vray-gpu-rtx Mode: NVIDIA RTX GPU pts/namd-cuda-1.1.1 ATPase Simulation - 327,506 Atoms pts/arrayfire-1.1.0 cg_opencl Test: Conjugate Gradient OpenCL pts/vkresample-1.0.0 -u 2 -p 0 Upscale: 2x - Precision: Single pts/blender-1.9.0 -b ../bmw27_gpu.blend -o output.test -x 1 -F JPEG -f 1 CUDA Blend File: BMW27 - Compute: CUDA pts/blender-1.9.0 -b ../bmw27_gpu.blend -o output.test -x 1 -F JPEG -f 1 OPTIX Blend File: BMW27 - Compute: NVIDIA OptiX pts/blender-1.9.0 -b ../classroom_gpu.blend -o output.test -x 1 -F JPEG -f 1 CUDA Blend File: Classroom - Compute: CUDA pts/blender-1.9.0 -b ../classroom_gpu.blend -o output.test -x 1 -F JPEG -f 1 OPTIX Blend File: Classroom - Compute: NVIDIA OptiX pts/blender-1.9.0 -b ../fishy_cat_gpu.blend -o output.test -x 1 -F JPEG -f 1 CUDA Blend File: Fishy Cat - Compute: CUDA pts/blender-1.9.0 -b ../fishy_cat_gpu.blend -o output.test -x 1 -F JPEG -f 1 OPTIX Blend File: Fishy Cat - Compute: NVIDIA OptiX pts/blender-1.9.0 -b ../pavillon_barcelone_gpu.blend -o output.test -x 1 -F JPEG -f 1 CUDA Blend File: Pabellon Barcelona - Compute: CUDA pts/blender-1.9.0 -b ../pavillon_barcelone_gpu.blend -o output.test -x 1 -F JPEG -f 1 OPTIX Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX pts/blender-1.9.0 -b ../barbershop_interior_gpu.blend -o output.test -x 1 -F JPEG -f 1 CUDA Blend File: Barbershop - Compute: CUDA pts/blender-1.9.0 -b ../barbershop_interior_gpu.blend -o output.test -x 1 -F JPEG -f 1 OPTIX Blend File: Barbershop - Compute: NVIDIA OptiX pts/realsr-ncnn-1.0.0 -s 4 -x Scale: 4x - TAA: Yes pts/realsr-ncnn-1.0.0 -s 4 Scale: 4x - TAA: No pts/waifu2x-ncnn-1.0.0 -s 2 -n 3 -x Scale: 2x - Denoise: 3 - TAA: Yes pts/betsy-1.0.0 --codec=etc1 --quality=2 Codec: ETC1 - Quality: Highest pts/betsy-1.0.0 --codec=etc2_rgb --quality=2 Codec: ETC2 RGB - Quality: Highest pts/redshift-1.0.1