d d d: Processor: Intel Celeron G4930 @ 3.20GHz (2 Cores), Motherboard: Gigabyte Z390 D (F3h BIOS), Chipset: Intel Cannon Lake PCH, Memory: 1 x 8 GB DDR4-2400MT/s, Disk: 120GB KINGSTON SA400M8, Graphics: eVGA NVIDIA GeForce RTX 3070 8GB, Audio: Realtek ALC887-VD, Network: Realtek RTL8111/8168/8411 OS: Ubuntu 22.04, Kernel: 6.2.0-39-generic (x86_64), Desktop: GNOME Shell 42.9, Display Server: X Server 1.21.1.4, Display Driver: NVIDIA 535.129.03, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.2.147, Vulkan: 1.3.242, Compiler: GCC 11.4.0, File-System: ext4, Screen Resolution: 1280x768 vkpeak 20230730 GFLOPS > Higher Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No Seconds < Lower Is Better d . 11.18 |==================================================================== RealSR-NCNN 20200818 Scale: 4x - TAA: Yes Seconds < Lower Is Better d . 52.28 |==================================================================== Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: No Seconds < Lower Is Better d . 2.304 |==================================================================== Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: Yes Seconds < Lower Is Better d . 5.101 |==================================================================== VkFFT 1.2.31 Test: FFT + iFFT R2C / C2R Benchmark Score > Higher Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C 1D batched in half precision Benchmark Score > Higher Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C Bluestein in single precision Benchmark Score > Higher Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C 1D batched in double precision Benchmark Score > Higher Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C 1D batched in single precision Benchmark Score > Higher Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C multidimensional in single precision Benchmark Score > Higher Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C Bluestein benchmark in double precision Benchmark Score > Higher Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling Benchmark Score > Higher Is Better Hashcat 6.2.4 Benchmark: MD5 H/s > Higher Is Better d . 41124633333 |============================================================== Hashcat 6.2.4 Benchmark: SHA1 H/s > Higher Is Better d . 12826400000 |============================================================== Hashcat 6.2.4 Benchmark: 7-Zip H/s > Higher Is Better d . 610147 |=================================================================== Hashcat 6.2.4 Benchmark: SHA-512 H/s > Higher Is Better d . 1622266667 |=============================================================== Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS H/s > Higher Is Better d . 480100 |=================================================================== Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Integer Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Integer Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Half Precision Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Double Precision Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Single Precision SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D GFLOPS > Higher Is Better d . 220.12 |=================================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad GB/s > Higher Is Better d . 12.40 |==================================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP GFLOPS > Higher Is Better d . 1136.12 |================================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash GHash/s > Higher Is Better d . 25.22 |==================================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction GB/s > Higher Is Better d . 334.04 |=================================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N GFLOPS > Higher Is Better d . 3784.38 |================================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops GFLOPS > Higher Is Better d . 22675.5 |================================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download GB/s > Higher Is Better d . 12.89 |==================================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback GB/s > Higher Is Better d . 13.18 |==================================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth GB/s > Higher Is Better d . 2086.29 |================================================================== Libplacebo 5.229.1 FPS > Higher Is Better cl-mem 2017-01-13 Benchmark: Copy GB/s > Higher Is Better d . 292.6 |==================================================================== cl-mem 2017-01-13 Benchmark: Read GB/s > Higher Is Better d . 391.6 |==================================================================== cl-mem 2017-01-13 Benchmark: Write GB/s > Higher Is Better d . 386.4 |==================================================================== NAMD CUDA 2.14 ATPase Simulation - 327,506 Atoms days/ns < Lower Is Better d . 0.62104 |================================================================== Betsy GPU Compressor 1.1 Beta Codec: ETC1 - Quality: Highest Seconds < Lower Is Better Betsy GPU Compressor 1.1 Beta Codec: ETC2 RGB - Quality: Highest Seconds < Lower Is Better VkResample 1.0 Upscale: 2x - Precision: Double ms < Lower Is Better d . 500.00 |=================================================================== VkResample 1.0 Upscale: 2x - Precision: Single ms < Lower Is Better d . 20.51 |==================================================================== OctaneBench 2020.1 Total Score Score > Higher Is Better d . 405.39 |=================================================================== RedShift Demo 3.0 Seconds < Lower Is Better FAHBench 2.3.2 Ns Per Day > Higher Is Better d . 239.32 |=================================================================== clpeak 1.1.2 OpenCL Test: Integer Compute INT GIOPS > Higher Is Better d . 10247.32 |================================================================= clpeak 1.1.2 OpenCL Test: Single-Precision Float GFLOPS > Higher Is Better d . 19869.82 |================================================================= clpeak 1.1.2 OpenCL Test: Double-Precision Double GFLOPS > Higher Is Better d . 350.02 |=================================================================== clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth GBPS > Higher Is Better d . 390.74 |=================================================================== LeelaChessZero 0.28 Backend: OpenCL Nodes Per Second > Higher Is Better d . 16653 |==================================================================== Rodinia 3.1 Test: OpenCL Particle Filter Seconds < Lower Is Better d . 6.293 |==================================================================== LuxCoreRender 2.6 Scene: DLSC - Acceleration: GPU M samples/sec > Higher Is Better d . 6.61 |===================================================================== LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: GPU M samples/sec > Higher Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: GPU M samples/sec > Higher Is Better d . 6.34 |===================================================================== LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: GPU M samples/sec > Higher Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: GPU M samples/sec > Higher Is Better d . 18.63 |==================================================================== ArrayFire 3.9 Test: Conjugate Gradient OpenCL ms < Lower Is Better d . 2.444 |==================================================================== FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL ms < Lower Is Better d . 9.734 |==================================================================== ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GB/s > Higher Is Better d . 8.48 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GB/s > Higher Is Better d . 12.57 |==================================================================== ViennaCL 1.7.1 Test: CPU BLAS - sDOT GB/s > Higher Is Better d . 10.28 |==================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GB/s > Higher Is Better d . 8.70 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GB/s > Higher Is Better d . 13.2 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dDOT GB/s > Higher Is Better d . 13.54 |==================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GB/s > Higher Is Better d . 13.9 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GB/s > Higher Is Better d . 10.30 |==================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GFLOPs/s > Higher Is Better d . 4.14 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GFLOPs/s > Higher Is Better d . 4.04 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GFLOPs/s > Higher Is Better d . 4.22 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GFLOPs/s > Higher Is Better d . 4.26 |===================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY GB/s > Higher Is Better d . 291 |====================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY GB/s > Higher Is Better d . 354 |====================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT GB/s > Higher Is Better d . 293 |====================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY GB/s > Higher Is Better d . 366 |====================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY GB/s > Higher Is Better d . 395 |====================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT GB/s > Higher Is Better d . 390 |====================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N GB/s > Higher Is Better d . 175 |====================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T GB/s > Higher Is Better d . 316 |====================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN GFLOPs/s > Higher Is Better d . 333 |====================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT GFLOPs/s > Higher Is Better d . 332 |====================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN GFLOPs/s > Higher Is Better d . 333 |====================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT GFLOPs/s > Higher Is Better d . 332 |====================================================================== Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better NCNN 20230517 Target: Vulkan GPU - Model: mobilenet ms < Lower Is Better d . 104.44 |=================================================================== NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 ms < Lower Is Better d . 31.02 |==================================================================== NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 ms < Lower Is Better d . 24.79 |==================================================================== NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 ms < Lower Is Better d . 13.69 |==================================================================== NCNN 20230517 Target: Vulkan GPU - Model: mnasnet ms < Lower Is Better d . 27.79 |==================================================================== NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 ms < Lower Is Better d . 47.48 |==================================================================== NCNN 20230517 Target: Vulkan GPU - Model: blazeface ms < Lower Is Better d . 4.31 |===================================================================== NCNN 20230517 Target: Vulkan GPU - Model: googlenet ms < Lower Is Better d . 81.73 |==================================================================== NCNN 20230517 Target: Vulkan GPU - Model: vgg16 ms < Lower Is Better d . 387.49 |=================================================================== NCNN 20230517 Target: Vulkan GPU - Model: resnet18 ms < Lower Is Better d . 72.26 |==================================================================== NCNN 20230517 Target: Vulkan GPU - Model: alexnet ms < Lower Is Better d . 56.42 |==================================================================== NCNN 20230517 Target: Vulkan GPU - Model: resnet50 ms < Lower Is Better d . 190.50 |=================================================================== NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny ms < Lower Is Better d . 133.84 |=================================================================== NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd ms < Lower Is Better d . 55.71 |==================================================================== NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m ms < Lower Is Better d . 35.92 |==================================================================== NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer ms < Lower Is Better d . 2437.71 |================================================================== NCNN 20230517 Target: Vulkan GPU - Model: FastestDet ms < Lower Is Better d . 15.92 |==================================================================== PlaidML FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL Examples Per Second > Higher Is Better Blender 4.0 Blend File: BMW27 - Compute: NVIDIA OptiX Seconds < Lower Is Better d . 11.48 |==================================================================== Blender 4.0 Blend File: Classroom - Compute: NVIDIA OptiX Seconds < Lower Is Better d . 27.07 |==================================================================== Blender 4.0 Blend File: Fishy Cat - Compute: NVIDIA OptiX Seconds < Lower Is Better d . 20.93 |==================================================================== Blender 4.0 Blend File: Barbershop - Compute: NVIDIA OptiX Seconds < Lower Is Better Blender 4.0 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX Seconds < Lower Is Better d . 31.00 |==================================================================== IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Bedroom M samples/s > Higher Is Better d . 13.05 |==================================================================== IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Supercar M samples/s > Higher Is Better d . 35.68 |==================================================================== MandelGPU 1.3pts1 OpenCL Device: GPU Samples/sec > Higher Is Better d . 290490729.3 |============================================================== NeatBench 5 Acceleration: GPU FPS > Higher Is Better d . 3070 |=====================================================================