heikows3-2023-08-18-nvidia-gpu-compute AMD EPYC 7313 16-Core testing with a GIGABYTE MZE2-G10-00 v01010101 (M07 BIOS) and ASPEED 45GB on Debian 12 via the Phoronix Test Suite. heikows3-2023-08-18-nvidia-gpu-compute: Processor: AMD EPYC 7313 16-Core @ 3.00GHz (16 Cores / 32 Threads), Motherboard: GIGABYTE MZE2-G10-00 v01010101 (M07 BIOS), Chipset: AMD Starship/Matisse, Memory: 8 x 32 GB DDR4-3200MT/s 36ASF4G72PZ-3G2E7, Disk: 7682GB Micron_7450_MTFDKCC7T6TFR + 1920GB Micron_7450_MTFDKBG1T9TFR, Graphics: ASPEED 45GB, Network: 2 x Intel I350 OS: Debian 12, Kernel: 6.2.16-3-pve (x86_64), Display Server: X Server, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.2.79, Compiler: GCC 12.2.0, File-System: ext4, Screen Resolution: 640x480 SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops GFLOPS > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 87811.9 |============================= NCNN 20230517 Target: Vulkan GPU - Model: FastestDet ms < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 8.40 |================================ NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer ms < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 98.46 |=============================== NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m ms < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 15.38 |=============================== NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd ms < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 12.44 |=============================== NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny ms < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 24.55 |=============================== NCNN 20230517 Target: Vulkan GPU - Model: resnet50 ms < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 16.37 |=============================== NCNN 20230517 Target: Vulkan GPU - Model: alexnet ms < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 7.88 |================================ NCNN 20230517 Target: Vulkan GPU - Model: resnet18 ms < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 8.71 |================================ NCNN 20230517 Target: Vulkan GPU - Model: vgg16 ms < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 30.29 |=============================== NCNN 20230517 Target: Vulkan GPU - Model: googlenet ms < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 14.86 |=============================== NCNN 20230517 Target: Vulkan GPU - Model: blazeface ms < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 2.52 |================================ NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 ms < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 7.87 |================================ NCNN 20230517 Target: Vulkan GPU - Model: mnasnet ms < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 5.33 |================================ NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 ms < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 7.41 |================================ NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 ms < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 6.33 |================================ NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 ms < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 5.27 |================================ NCNN 20230517 Target: Vulkan GPU - Model: mobilenet ms < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 17.00 |=============================== LeelaChessZero 0.28 Backend: OpenCL Nodes Per Second > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 37362 |=============================== LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: GPU M samples/sec > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 12.82 |=============================== LuxCoreRender 2.6 Scene: DLSC - Acceleration: GPU M samples/sec > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 14.66 |=============================== FAHBench 2.3.2 Ns Per Day > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 395.17 |============================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GFLOPs/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 71.8 |================================ ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GFLOPs/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 74.1 |================================ ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GFLOPs/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 69.5 |================================ ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GFLOPs/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 73.7 |================================ ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 233 |================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 202.0 |=============================== ViennaCL 1.7.1 Test: CPU BLAS - dDOT GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 313.4 |=============================== ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 411 |================================= ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 274 |================================= ViennaCL 1.7.1 Test: CPU BLAS - sDOT GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 252 |================================= ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 1042 |================================ ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 764 |================================= LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: GPU M samples/sec > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 12.72 |=============================== LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: GPU M samples/sec > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 12.87 |=============================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN GFLOPs/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 1180 |================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN GFLOPs/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 1190 |================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 394 |================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 196 |================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 621 |================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 624 |================================= ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 546 |================================= ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 774 |================================= ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 1253 |================================ ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 879 |================================= ArrayFire 3.7 Test: Conjugate Gradient OpenCL ms < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 1.127 |=============================== Rodinia 3.1 Test: OpenCL Particle Filter Seconds < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 2.137 |=============================== LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: GPU M samples/sec > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 27.32 |=============================== clpeak 1.1.2 OpenCL Test: Double-Precision Double GFLOPS > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 1412.32 |============================= Hashcat 6.2.4 Benchmark: SHA-512 H/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 5317300000 |========================== Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS H/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 1464567 |============================= Hashcat 6.2.4 Benchmark: SHA1 H/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 43759733333 |========================= Hashcat 6.2.4 Benchmark: MD5 H/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 125566666667 |======================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 2715.40 |============================= Hashcat 6.2.4 Benchmark: 7-Zip H/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 2064433 |============================= cl-mem 2017-01-13 Benchmark: Copy GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 354.2 |=============================== cl-mem 2017-01-13 Benchmark: Write GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 449.8 |=============================== cl-mem 2017-01-13 Benchmark: Read GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 697.2 |=============================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D GFLOPS > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 527.28 |============================== NeatBench 5 Acceleration: GPU FPS > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 126 |================================= clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth GBPS > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 670.25 |============================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 26.31 |=============================== FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL ms < Lower Is Better heikows3-2023-08-18-nvidia-gpu-compute . 2.791 |=============================== MandelGPU 1.3pts1 OpenCL Device: GPU Samples/sec > Higher Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 25.18 |=============================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP GFLOPS > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 2134.90 |============================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N GFLOPS > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 23502.1 |============================= clpeak 1.1.2 OpenCL Test: Integer Compute INT GIOPS > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 40540.41 |============================ SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 26.76 |=============================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction GB/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 877.57 |============================== clpeak 1.1.2 OpenCL Test: Single-Precision Float GFLOPS > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 82945.32 |============================ SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash GHash/s > Higher Is Better heikows3-2023-08-18-nvidia-gpu-compute . 99.47 |=============================== PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better RedShift Demo 3.0 Seconds < Lower Is Better Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Integer