nv-test-run nvidia test run nvidia test run: Processor: AMD Ryzen 9 5900 12-Core @ 3.00GHz (12 Cores / 24 Threads), Motherboard: Alienware 0TYR0X (2.1.2 BIOS), Chipset: AMD Starship/Matisse, Memory: 32GB, Disk: Kioxia KXG70PNV2T04 NVMe 2048GB, Graphics: NVIDIA GeForce RTX 3080 10GB, Audio: NVIDIA Device 1aef, Monitor: LEN L24q-30, Network: Realtek Device 2600 + Intel Wi-Fi 6 AX200 OS: Ubuntu 20.04, Kernel: 5.8.0-63-generic (x86_64), Desktop: GNOME Shell 3.36.9, Display Server: X Server 1.20.9, Display Driver: NVIDIA 470.57.02, OpenCL: OpenCL 3.0 CUDA 11.4.94, Vulkan: 1.2.175, Compiler: GCC 9.3.0, File-System: ext4, Screen Resolution: 2560x1440 vkpeak 20210424 fp32-scalar GFLOPS > Higher Is Better nvidia test run . 17147.74 |=================================================== vkpeak 20210424 fp32-vec4 GFLOPS > Higher Is Better nvidia test run . 22559.76 |=================================================== vkpeak 20210424 fp16-scalar GFLOPS > Higher Is Better nvidia test run . 17093.14 |=================================================== vkpeak 20210424 fp16-vec4 GFLOPS > Higher Is Better nvidia test run . 33156.88 |=================================================== vkpeak 20210424 fp64-scalar GFLOPS > Higher Is Better nvidia test run . 532.86 |===================================================== vkpeak 20210424 fp64-vec4 GFLOPS > Higher Is Better nvidia test run . 533.37 |===================================================== vkpeak 20210424 int32-scalar GIOPS > Higher Is Better nvidia test run . 17022.36 |=================================================== vkpeak 20210424 int32-vec4 GIOPS > Higher Is Better nvidia test run . 16945.40 |=================================================== vkpeak 20210424 int16-scalar GIOPS > Higher Is Better nvidia test run . 11203.87 |=================================================== vkpeak 20210424 int16-vec4 GIOPS > Higher Is Better nvidia test run . 13785.17 |=================================================== RealSR-NCNN 20200818 Scale: 4x - TAA: No Seconds < Lower Is Better nvidia test run . 6.363 |====================================================== RealSR-NCNN 20200818 Scale: 4x - TAA: Yes Seconds < Lower Is Better nvidia test run . 33.78 |====================================================== Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: Yes Seconds < Lower Is Better nvidia test run . 3.519 |====================================================== VkFFT 1.1.1 Benchmark Score > Higher Is Better nvidia test run . 41236 |====================================================== Hashcat 6.1.1 Benchmark: MD5 H/s > Higher Is Better nvidia test run . 54531666667 |================================================ Hashcat 6.1.1 Benchmark: SHA1 H/s > Higher Is Better nvidia test run . 18624600000 |================================================ Hashcat 6.1.1 Benchmark: 7-Zip H/s > Higher Is Better nvidia test run . 955167 |===================================================== Hashcat 6.1.1 Benchmark: SHA-512 H/s > Higher Is Better nvidia test run . 2354766667 |================================================= Hashcat 6.1.1 Benchmark: TrueCrypt RIPEMD160 + XTS H/s > Higher Is Better nvidia test run . 702933 |===================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D GFLOPS > Higher Is Better nvidia test run . 338.63 |===================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad GB/s > Higher Is Better nvidia test run . 13.14 |====================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP GFLOPS > Higher Is Better nvidia test run . 1918.08 |==================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash GHash/s > Higher Is Better nvidia test run . 36.79 |====================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction GB/s > Higher Is Better nvidia test run . 385.29 |===================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N GFLOPS > Higher Is Better nvidia test run . 6265.63 |==================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops GFLOPS > Higher Is Better nvidia test run . 33021.0 |==================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download GB/s > Higher Is Better nvidia test run . 13.44 |====================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback GB/s > Higher Is Better nvidia test run . 13.53 |====================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth GB/s > Higher Is Better nvidia test run . 2184.00 |==================================================== cl-mem 2017-01-13 Benchmark: Copy GB/s > Higher Is Better nvidia test run . 354.4 |====================================================== cl-mem 2017-01-13 Benchmark: Read GB/s > Higher Is Better nvidia test run . 672.5 |====================================================== cl-mem 2017-01-13 Benchmark: Write GB/s > Higher Is Better nvidia test run . 648.7 |====================================================== NAMD CUDA 2.14 ATPase Simulation - 327,506 Atoms days/ns < Lower Is Better nvidia test run . 0.14201 |==================================================== Betsy GPU Compressor 1.1 Beta Codec: ETC1 - Quality: Highest Seconds < Lower Is Better nvidia test run . 3.409 |====================================================== Betsy GPU Compressor 1.1 Beta Codec: ETC2 RGB - Quality: Highest Seconds < Lower Is Better nvidia test run . 4.589 |====================================================== VkResample 1.0 Upscale: 2x - Precision: Double ms < Lower Is Better nvidia test run . 144.99 |===================================================== VkResample 1.0 Upscale: 2x - Precision: Single ms < Lower Is Better nvidia test run . 11.27 |====================================================== OctaneBench 2020.1 Total Score Score > Higher Is Better nvidia test run . 566.50 |===================================================== FAHBench 2.3.2 Ns Per Day > Higher Is Better nvidia test run . 315.49 |===================================================== Rodinia 3.1 Test: OpenCL Particle Filter Seconds < Lower Is Better nvidia test run . 4.356 |====================================================== ArrayFire 3.7 Test: Conjugate Gradient OpenCL ms < Lower Is Better nvidia test run . 1.576 |====================================================== LuxCoreRender 2.5 Scene: DLSC - Acceleration: GPU M samples/sec > Higher Is Better nvidia test run . 9.49 |======================================================= LuxCoreRender 2.5 Scene: Danish Mood - Acceleration: GPU M samples/sec > Higher Is Better nvidia test run . 6.40 |======================================================= LuxCoreRender 2.5 Scene: Orange Juice - Acceleration: GPU M samples/sec > Higher Is Better nvidia test run . 9.08 |======================================================= LuxCoreRender 2.5 Scene: LuxCore Benchmark - Acceleration: GPU M samples/sec > Higher Is Better nvidia test run . 8.25 |======================================================= LuxCoreRender 2.5 Scene: Rainbow Colors and Prism - Acceleration: GPU M samples/sec > Higher Is Better nvidia test run . 26.73 |====================================================== FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL ms < Lower Is Better nvidia test run . 7.346 |====================================================== ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GB/s > Higher Is Better nvidia test run . 62.5 |======================================================= ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GB/s > Higher Is Better nvidia test run . 93.8 |======================================================= ViennaCL 1.7.1 Test: CPU BLAS - sDOT GB/s > Higher Is Better nvidia test run . 146 |======================================================== ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GB/s > Higher Is Better nvidia test run . 22.9 |======================================================= ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GB/s > Higher Is Better nvidia test run . 34.3 |======================================================= ViennaCL 1.7.1 Test: CPU BLAS - dDOT GB/s > Higher Is Better nvidia test run . 44.9 |======================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GB/s > Higher Is Better nvidia test run . 76.3 |======================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GB/s > Higher Is Better nvidia test run . 83.6 |======================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GFLOPs/s > Higher Is Better nvidia test run . 40.3 |======================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GFLOPs/s > Higher Is Better nvidia test run . 42.8 |======================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GFLOPs/s > Higher Is Better nvidia test run . 42.0 |======================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GFLOPs/s > Higher Is Better nvidia test run . 40.0 |======================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY GB/s > Higher Is Better nvidia test run . 353 |======================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY GB/s > Higher Is Better nvidia test run . 470 |======================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT GB/s > Higher Is Better nvidia test run . 371 |======================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY GB/s > Higher Is Better nvidia test run . 555 |======================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY GB/s > Higher Is Better nvidia test run . 628 |======================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT GB/s > Higher Is Better nvidia test run . 596 |======================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N GB/s > Higher Is Better nvidia test run . 240 |======================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T GB/s > Higher Is Better nvidia test run . 374 |======================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN GFLOPs/s > Higher Is Better nvidia test run . 498 |======================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT GFLOPs/s > Higher Is Better nvidia test run . 499 |======================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN GFLOPs/s > Higher Is Better nvidia test run . 496 |======================================================== NCNN 20210525 Target: Vulkan GPU - Model: mobilenet ms < Lower Is Better nvidia test run . 14.28 |====================================================== NCNN 20210525 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 ms < Lower Is Better nvidia test run . 4.68 |======================================================= NCNN 20210525 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 ms < Lower Is Better nvidia test run . 4.61 |======================================================= NCNN 20210525 Target: Vulkan GPU - Model: shufflenet-v2 ms < Lower Is Better nvidia test run . 4.67 |======================================================= NCNN 20210525 Target: Vulkan GPU - Model: mnasnet ms < Lower Is Better nvidia test run . 4.25 |======================================================= NCNN 20210525 Target: Vulkan GPU - Model: efficientnet-b0 ms < Lower Is Better nvidia test run . 6.02 |======================================================= NCNN 20210525 Target: Vulkan GPU - Model: blazeface ms < Lower Is Better nvidia test run . 1.94 |======================================================= NCNN 20210525 Target: Vulkan GPU - Model: googlenet ms < Lower Is Better nvidia test run . 15.35 |====================================================== NCNN 20210525 Target: Vulkan GPU - Model: vgg16 ms < Lower Is Better nvidia test run . 57.59 |====================================================== NCNN 20210525 Target: Vulkan GPU - Model: resnet18 ms < Lower Is Better nvidia test run . 15.52 |====================================================== NCNN 20210525 Target: Vulkan GPU - Model: alexnet ms < Lower Is Better nvidia test run . 12.29 |====================================================== NCNN 20210525 Target: Vulkan GPU - Model: resnet50 ms < Lower Is Better nvidia test run . 25.38 |====================================================== NCNN 20210525 Target: Vulkan GPU - Model: yolov4-tiny ms < Lower Is Better nvidia test run . 25.38 |====================================================== NCNN 20210525 Target: Vulkan GPU - Model: squeezenet_ssd ms < Lower Is Better nvidia test run . 16.95 |====================================================== NCNN 20210525 Target: Vulkan GPU - Model: regnety_400m ms < Lower Is Better nvidia test run . 11.74 |====================================================== IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Bedroom M samples/s > Higher Is Better nvidia test run . 17.51 |====================================================== IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Supercar M samples/s > Higher Is Better nvidia test run . 46.15 |====================================================== Blender 2.92 Blend File: BMW27 - Compute: CUDA Seconds < Lower Is Better nvidia test run . 22.39 |====================================================== Blender 2.92 Blend File: Classroom - Compute: CUDA Seconds < Lower Is Better nvidia test run . 61.46 |====================================================== Blender 2.92 Blend File: Fishy Cat - Compute: CUDA Seconds < Lower Is Better nvidia test run . 42.34 |====================================================== Blender 2.92 Blend File: Barbershop - Compute: CUDA Seconds < Lower Is Better nvidia test run . 448.21 |===================================================== Blender 2.92 Blend File: BMW27 - Compute: NVIDIA OptiX Seconds < Lower Is Better nvidia test run . 14.37 |====================================================== Blender 2.92 Blend File: Classroom - Compute: NVIDIA OptiX Seconds < Lower Is Better nvidia test run . 34.36 |====================================================== Blender 2.92 Blend File: Fishy Cat - Compute: NVIDIA OptiX Seconds < Lower Is Better nvidia test run . 25.89 |====================================================== Blender 2.92 Blend File: Barbershop - Compute: NVIDIA OptiX Seconds < Lower Is Better nvidia test run . 375.95 |===================================================== Blender 2.92 Blend File: Pabellon Barcelona - Compute: CUDA Seconds < Lower Is Better nvidia test run . 146.68 |===================================================== Blender 2.92 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX Seconds < Lower Is Better nvidia test run . 52.49 |====================================================== MandelGPU 1.3pts1 OpenCL Device: GPU Samples/sec > Higher Is Better nvidia test run . 413842119.7 |================================================ clpeak OpenCL Test: Integer Compute INT GIOPS > Higher Is Better nvidia test run . 15263.57 |=================================================== clpeak OpenCL Test: Single-Precision Float GFLOPS > Higher Is Better nvidia test run . 29406.19 |=================================================== clpeak OpenCL Test: Double-Precision Double GFLOPS > Higher Is Better nvidia test run . 538.30 |===================================================== clpeak OpenCL Test: Global Memory Bandwidth GBPS > Higher Is Better nvidia test run . 662.28 |===================================================== NeatBench 5 Acceleration: GPU FPS > Higher Is Better nvidia test run . 3080 |=======================================================