cuda-RTX3060M-2021nov nVIDIA GeForce RTX 3060 Laptop GPU 6 GB (94.06.22.00.47 VBIOS) testing with a Dell G15 (model 5511, 1.7.0 BIOS) and Intel Core i7-11800H on Ubuntu 20.04.3 via the Phoronix Test Suite. RTX 3060M: Processor: Intel Core i7-11800H @ 4.60GHz (8 Cores / 16 Threads), Motherboard: Dell 0YC2KJ (1.7.0 BIOS), Chipset: Intel Device 43ef, Memory: 16GB, Disk: Kioxia KBG40ZNS1T02 NVMe 1024GB, Graphics: NVIDIA GeForce RTX 3060 Laptop GPU 6GB, Audio: Intel Device 43c8, Network: Realtek Device 2600 + Intel Device 43f0 OS: Ubuntu 20.04, Kernel: 5.14.0-1007-oem (x86_64), Desktop: GNOME Shell 3.36.9, Display Server: X Server 1.20.11, Display Driver: NVIDIA 495.44, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 11.5.100, Vulkan: 1.2.145, Compiler: GCC 10.3.0 + Clang 10.0.0-4ubuntu1, File-System: ext4, Screen Resolution: 1920x1080 vkpeak 20210424 fp32-scalar GFLOPS > Higher Is Better RTX 3060M . 7549.77 |========================================================== vkpeak 20210424 fp32-vec4 GFLOPS > Higher Is Better RTX 3060M . 9971.75 |========================================================== vkpeak 20210424 fp16-scalar GFLOPS > Higher Is Better RTX 3060M . 7547.91 |========================================================== vkpeak 20210424 fp16-vec4 GFLOPS > Higher Is Better RTX 3060M . 14856.37 |========================================================= vkpeak 20210424 fp64-scalar GFLOPS > Higher Is Better RTX 3060M . 233.56 |=========================================================== vkpeak 20210424 fp64-vec4 GFLOPS > Higher Is Better RTX 3060M . 235.04 |=========================================================== vkpeak 20210424 int32-scalar GIOPS > Higher Is Better RTX 3060M . 7490.12 |========================================================== vkpeak 20210424 int32-vec4 GIOPS > Higher Is Better RTX 3060M . 7458.71 |========================================================== vkpeak 20210424 int16-scalar GIOPS > Higher Is Better RTX 3060M . 4951.29 |========================================================== vkpeak 20210424 int16-vec4 GIOPS > Higher Is Better RTX 3060M . 6373.26 |========================================================== RealSR-NCNN 20200818 Scale: 4x - TAA: No Seconds < Lower Is Better RTX 3060M . 11.25 |============================================================ RealSR-NCNN 20200818 Scale: 4x - TAA: Yes Seconds < Lower Is Better RTX 3060M . 71.80 |============================================================ Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: No Seconds < Lower Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: Yes Seconds < Lower Is Better RTX 3060M . 5.184 |============================================================ VkFFT 1.1.1 Benchmark Score > Higher Is Better Hashcat 6.2.4 Benchmark: MD5 H/s > Higher Is Better RTX 3060M . 25675266667 |====================================================== Hashcat 6.2.4 Benchmark: SHA1 H/s > Higher Is Better RTX 3060M . 8123800000 |======================================================= Hashcat 6.2.4 Benchmark: 7-Zip H/s > Higher Is Better RTX 3060M . 406433 |=========================================================== Hashcat 6.2.4 Benchmark: SHA-512 H/s > Higher Is Better RTX 3060M . 1030500000 |======================================================= Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS H/s > Higher Is Better RTX 3060M . 299733 |=========================================================== Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Integer Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Integer Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Half Precision Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Double Precision Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Single Precision SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D GFLOPS > Higher Is Better RTX 3060M . 166.17 |=========================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad GB/s > Higher Is Better RTX 3060M . 12.23 |============================================================ SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP GFLOPS > Higher Is Better RTX 3060M . 779.04 |=========================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash GHash/s > Higher Is Better RTX 3060M . 16.03 |============================================================ SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction GB/s > Higher Is Better RTX 3060M . 296.18 |=========================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N GFLOPS > Higher Is Better RTX 3060M . 2787.44 |========================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops GFLOPS > Higher Is Better RTX 3060M . 14784.0 |========================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download GB/s > Higher Is Better RTX 3060M . 12.68 |============================================================ SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback GB/s > Higher Is Better RTX 3060M . 13.19 |============================================================ SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth GB/s > Higher Is Better RTX 3060M . 1309.13 |========================================================== Libplacebo 2.72.2 FPS > Higher Is Better cl-mem 2017-01-13 Benchmark: Copy GB/s > Higher Is Better RTX 3060M . 262.5 |============================================================ cl-mem 2017-01-13 Benchmark: Read GB/s > Higher Is Better RTX 3060M . 304.5 |============================================================ cl-mem 2017-01-13 Benchmark: Write GB/s > Higher Is Better RTX 3060M . 300.8 |============================================================ NAMD CUDA 2.14 ATPase Simulation - 327,506 Atoms days/ns < Lower Is Better RTX 3060M . 0.22094 |========================================================== Betsy GPU Compressor 1.1 Beta Codec: ETC1 - Quality: Highest Seconds < Lower Is Better RTX 3060M . 6.286 |============================================================ Betsy GPU Compressor 1.1 Beta Codec: ETC2 RGB - Quality: Highest Seconds < Lower Is Better RTX 3060M . 8.694 |============================================================ VkResample 1.0 Upscale: 2x - Precision: Double ms < Lower Is Better RTX 3060M . 327.15 |=========================================================== VkResample 1.0 Upscale: 2x - Precision: Single ms < Lower Is Better RTX 3060M . 24.88 |============================================================ OctaneBench 2020.1 Total Score Score > Higher Is Better RTX 3060M . 313.43 |=========================================================== RedShift Demo 3.0 Seconds < Lower Is Better FAHBench 2.3.2 Ns Per Day > Higher Is Better RTX 3060M . 195.50 |=========================================================== LeelaChessZero 0.28 Backend: OpenCL Nodes Per Second > Higher Is Better RTX 3060M . 25156 |============================================================ Rodinia 3.1 Test: OpenCL Particle Filter Seconds < Lower Is Better ArrayFire 3.7 Test: Conjugate Gradient OpenCL ms < Lower Is Better RTX 3060M . 2.604 |============================================================ LuxCoreRender 2.5 Scene: DLSC - Acceleration: GPU M samples/sec > Higher Is Better RTX 3060M . 4.60 |============================================================= LuxCoreRender 2.5 Scene: Danish Mood - Acceleration: GPU M samples/sec > Higher Is Better RTX 3060M . 2.80 |============================================================= LuxCoreRender 2.5 Scene: Orange Juice - Acceleration: GPU M samples/sec > Higher Is Better RTX 3060M . 4.99 |============================================================= LuxCoreRender 2.5 Scene: LuxCore Benchmark - Acceleration: GPU M samples/sec > Higher Is Better RTX 3060M . 3.72 |============================================================= LuxCoreRender 2.5 Scene: Rainbow Colors and Prism - Acceleration: GPU M samples/sec > Higher Is Better RTX 3060M . 15.32 |============================================================ FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL ms < Lower Is Better RTX 3060M . 15.93 |============================================================ ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GB/s > Higher Is Better RTX 3060M . 17.7 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GB/s > Higher Is Better RTX 3060M . 27.4 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - sDOT GB/s > Higher Is Better RTX 3060M . 35.5 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GB/s > Higher Is Better RTX 3060M . 15.2 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GB/s > Higher Is Better RTX 3060M . 23.4 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dDOT GB/s > Higher Is Better RTX 3060M . 30.1 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GB/s > Higher Is Better RTX 3060M . 40.3 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GB/s > Higher Is Better RTX 3060M . 40.5 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GFLOPs/s > Higher Is Better RTX 3060M . 36.8 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GFLOPs/s > Higher Is Better RTX 3060M . 38.2 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GFLOPs/s > Higher Is Better RTX 3060M . 38.3 |============================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GFLOPs/s > Higher Is Better RTX 3060M . 37.2 |============================================================= ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY GB/s > Higher Is Better RTX 3060M . 265 |============================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY GB/s > Higher Is Better RTX 3060M . 295 |============================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT GB/s > Higher Is Better RTX 3060M . 286 |============================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY GB/s > Higher Is Better RTX 3060M . 303 |============================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY GB/s > Higher Is Better RTX 3060M . 304 |============================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT GB/s > Higher Is Better RTX 3060M . 316 |============================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N GB/s > Higher Is Better RTX 3060M . 221 |============================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T GB/s > Higher Is Better RTX 3060M . 292 |============================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN GFLOPs/s > Higher Is Better RTX 3060M . 224 |============================================================== GROMACS 2021.2 Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare Ns Per Day > Higher Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better NCNN 20210720 Target: Vulkan GPU - Model: mobilenet ms < Lower Is Better RTX 3060M . 4.99 |============================================================= NCNN 20210720 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 ms < Lower Is Better RTX 3060M . 1.96 |============================================================= NCNN 20210720 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 ms < Lower Is Better RTX 3060M . 2.47 |============================================================= NCNN 20210720 Target: Vulkan GPU - Model: shufflenet-v2 ms < Lower Is Better RTX 3060M . 1.76 |============================================================= NCNN 20210720 Target: Vulkan GPU - Model: mnasnet ms < Lower Is Better RTX 3060M . 2.03 |============================================================= NCNN 20210720 Target: Vulkan GPU - Model: efficientnet-b0 ms < Lower Is Better RTX 3060M . 3.44 |============================================================= NCNN 20210720 Target: Vulkan GPU - Model: blazeface ms < Lower Is Better RTX 3060M . 0.95 |============================================================= NCNN 20210720 Target: Vulkan GPU - Model: googlenet ms < Lower Is Better RTX 3060M . 5.42 |============================================================= NCNN 20210720 Target: Vulkan GPU - Model: vgg16 ms < Lower Is Better RTX 3060M . 7.30 |============================================================= NCNN 20210720 Target: Vulkan GPU - Model: resnet18 ms < Lower Is Better RTX 3060M . 1.92 |============================================================= NCNN 20210720 Target: Vulkan GPU - Model: alexnet ms < Lower Is Better RTX 3060M . 2.04 |============================================================= NCNN 20210720 Target: Vulkan GPU - Model: resnet50 ms < Lower Is Better RTX 3060M . 4.33 |============================================================= NCNN 20210720 Target: Vulkan GPU - Model: yolov4-tiny ms < Lower Is Better RTX 3060M . 8.04 |============================================================= NCNN 20210720 Target: Vulkan GPU - Model: squeezenet_ssd ms < Lower Is Better RTX 3060M . 5.11 |============================================================= NCNN 20210720 Target: Vulkan GPU - Model: regnety_400m ms < Lower Is Better RTX 3060M . 2.47 |============================================================= PlaidML FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL Examples Per Second > Higher Is Better IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Bedroom M samples/s > Higher Is Better RTX 3060M . 9.150 |============================================================ IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Supercar M samples/s > Higher Is Better RTX 3060M . 27.73 |============================================================ Blender 2.92 Blend File: BMW27 - Compute: CUDA Seconds < Lower Is Better RTX 3060M . 42.32 |============================================================ Blender 2.92 Blend File: Classroom - Compute: CUDA Seconds < Lower Is Better RTX 3060M . 126.06 |=========================================================== Blender 2.92 Blend File: Fishy Cat - Compute: CUDA Seconds < Lower Is Better RTX 3060M . 80.93 |============================================================ Blender 2.92 Blend File: Barbershop - Compute: CUDA Seconds < Lower Is Better RTX 3060M . 929.40 |=========================================================== Blender 2.92 Blend File: BMW27 - Compute: NVIDIA OptiX Seconds < Lower Is Better RTX 3060M . 26.00 |============================================================ Blender 2.92 Blend File: Classroom - Compute: NVIDIA OptiX Seconds < Lower Is Better RTX 3060M . 76.95 |============================================================ Blender 2.92 Blend File: Fishy Cat - Compute: NVIDIA OptiX Seconds < Lower Is Better RTX 3060M . 54.58 |============================================================ Blender 2.92 Blend File: Barbershop - Compute: NVIDIA OptiX Seconds < Lower Is Better RTX 3060M . 835.91 |=========================================================== Blender 2.92 Blend File: Pabellon Barcelona - Compute: CUDA Seconds < Lower Is Better RTX 3060M . 303.34 |=========================================================== Blender 2.92 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX Seconds < Lower Is Better RTX 3060M . 115.23 |=========================================================== MandelGPU 1.3pts1 OpenCL Device: GPU Samples/sec > Higher Is Better RTX 3060M . 214129242.1 |====================================================== clpeak OpenCL Test: Integer Compute INT GIOPS > Higher Is Better RTX 3060M . 6510.64 |========================================================== clpeak OpenCL Test: Single-Precision Float GFLOPS > Higher Is Better RTX 3060M . 12934.27 |========================================================= clpeak OpenCL Test: Double-Precision Double GFLOPS > Higher Is Better RTX 3060M . 237.04 |=========================================================== clpeak OpenCL Test: Global Memory Bandwidth GBPS > Higher Is Better RTX 3060M . 301.35 |=========================================================== NeatBench 5 Acceleration: GPU FPS > Higher Is Better RTX 3060M . 1546.4 |===========================================================