FirstRun Test GPUFirstRun: Processor: 2 x AMD EPYC 7763 64-Core @ 2.45GHz (128 Cores / 256 Threads), Motherboard: Supermicro H12DSG-O-CPU (2.5 BIOS), Chipset: AMD Starship/Matisse, Memory: 512GB, Disk: 30724GB MR9560-16i + 480GB MR9560-16i, Graphics: ASPEED 45GB, Network: 2 x Intel 10G X550T + 2 x Intel I350 OS: Ubuntu 22.04, Kernel: 5.15.0-88-generic (x86_64), Display Server: X Server, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.3.68, Vulkan: 1.3.260, Compiler: GCC 11.4.0, File-System: ext4, Screen Resolution: 1024x768 PlaidML FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL Examples Per Second > Higher Is Better LeelaChessZero 0.28 Backend: OpenCL Nodes Per Second > Higher Is Better GPUFirstRun . 6142 |=========================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D GFLOPS > Higher Is Better GPUFirstRun . 529.43 |========================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad GB/s > Higher Is Better GPUFirstRun . 25.57 |========================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP GFLOPS > Higher Is Better GPUFirstRun . 2133.47 |======================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash GHash/s > Higher Is Better GPUFirstRun . 99.61 |========================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction GB/s > Higher Is Better GPUFirstRun . 875.83 |========================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N GFLOPS > Higher Is Better GPUFirstRun . 23867.2 |======================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops GFLOPS > Higher Is Better GPUFirstRun . 87477.7 |======================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download GB/s > Higher Is Better GPUFirstRun . 26.71 |========================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback GB/s > Higher Is Better GPUFirstRun . 27.13 |========================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth GB/s > Higher Is Better GPUFirstRun . 2704.74 |======================================================== NCNN 20230517 Target: Vulkan GPU - Model: mobilenet ms < Lower Is Better GPUFirstRun . 116.77 |========================================================= NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 ms < Lower Is Better GPUFirstRun . 69.55 |========================================================== NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 ms < Lower Is Better GPUFirstRun . 54.22 |========================================================== NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 ms < Lower Is Better GPUFirstRun . 42.15 |========================================================== NCNN 20230517 Target: Vulkan GPU - Model: mnasnet ms < Lower Is Better GPUFirstRun . 41.20 |========================================================== NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 ms < Lower Is Better GPUFirstRun . 89.70 |========================================================== NCNN 20230517 Target: Vulkan GPU - Model: blazeface ms < Lower Is Better GPUFirstRun . 126.44 |========================================================= NCNN 20230517 Target: Vulkan GPU - Model: googlenet ms < Lower Is Better GPUFirstRun . 211.62 |========================================================= NCNN 20230517 Target: Vulkan GPU - Model: vgg16 ms < Lower Is Better GPUFirstRun . 96.69 |========================================================== NCNN 20230517 Target: Vulkan GPU - Model: resnet18 ms < Lower Is Better GPUFirstRun . 42.48 |========================================================== NCNN 20230517 Target: Vulkan GPU - Model: alexnet ms < Lower Is Better GPUFirstRun . 37.06 |========================================================== NCNN 20230517 Target: Vulkan GPU - Model: resnet50 ms < Lower Is Better GPUFirstRun . 256.83 |========================================================= NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny ms < Lower Is Better GPUFirstRun . 73.26 |========================================================== NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd ms < Lower Is Better GPUFirstRun . 494.16 |========================================================= NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m ms < Lower Is Better GPUFirstRun . 630.73 |========================================================= NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer ms < Lower Is Better GPUFirstRun . 190.11 |========================================================= NCNN 20230517 Target: Vulkan GPU - Model: FastestDet ms < Lower Is Better GPUFirstRun . 98.48 |========================================================== Rodinia 3.1 Test: OpenCL Particle Filter Seconds < Lower Is Better GPUFirstRun . 3.204 |========================================================== ArrayFire 3.7 Test: Conjugate Gradient OpenCL ms < Lower Is Better GPUFirstRun . 1.260 |========================================================== NeatBench 5 Acceleration: GPU FPS > Higher Is Better GPUFirstRun . 125 |============================================================ LuxCoreRender 2.6 Scene: DLSC - Acceleration: GPU M samples/sec > Higher Is Better GPUFirstRun . 52.90 |========================================================== LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: GPU M samples/sec > Higher Is Better GPUFirstRun . 43.17 |========================================================== LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: GPU M samples/sec > Higher Is Better GPUFirstRun . 45.05 |========================================================== LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: GPU M samples/sec > Higher Is Better GPUFirstRun . 31.73 |========================================================== LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: GPU M samples/sec > Higher Is Better GPUFirstRun . 78.08 |========================================================== FAHBench 2.3.2 Ns Per Day > Higher Is Better GPUFirstRun . 407.49 |========================================================= Hashcat 6.2.4 Benchmark: MD5 H/s > Higher Is Better GPUFirstRun . 490100000000 |=================================================== Hashcat 6.2.4 Benchmark: SHA1 H/s > Higher Is Better GPUFirstRun . 106311306250 |=================================================== Hashcat 6.2.4 Benchmark: 7-Zip H/s > Higher Is Better GPUFirstRun . 7872533 |======================================================== Hashcat 6.2.4 Benchmark: SHA-512 H/s > Higher Is Better GPUFirstRun . 20675766667 |==================================================== Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS H/s > Higher Is Better GPUFirstRun . 6267933 |======================================================== Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Integer Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision RedShift Demo 3.0 Seconds < Lower Is Better FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL ms < Lower Is Better GPUFirstRun . 3.008 |========================================================== cl-mem 2017-01-13 Benchmark: Copy GB/s > Higher Is Better GPUFirstRun . 354.3 |========================================================== cl-mem 2017-01-13 Benchmark: Read GB/s > Higher Is Better GPUFirstRun . 697.7 |========================================================== cl-mem 2017-01-13 Benchmark: Write GB/s > Higher Is Better GPUFirstRun . 450.0 |========================================================== clpeak 1.1.2 OpenCL Test: Integer Compute INT GIOPS > Higher Is Better GPUFirstRun . 41093.44 |======================================================= clpeak 1.1.2 OpenCL Test: Single-Precision Float GFLOPS > Higher Is Better GPUFirstRun . 83524.25 |======================================================= clpeak 1.1.2 OpenCL Test: Double-Precision Double GFLOPS > Higher Is Better GPUFirstRun . 1413.46 |======================================================== clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth GBPS > Higher Is Better GPUFirstRun . 670.38 |========================================================= MandelGPU 1.3pts1 OpenCL Device: GPU Samples/sec > Higher Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GB/s > Higher Is Better GPUFirstRun . 454 |============================================================ ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GB/s > Higher Is Better GPUFirstRun . 571 |============================================================ ViennaCL 1.7.1 Test: CPU BLAS - sDOT GB/s > Higher Is Better GPUFirstRun . 430 |============================================================ ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GB/s > Higher Is Better GPUFirstRun . 640 |============================================================ ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GB/s > Higher Is Better GPUFirstRun . 935 |============================================================ ViennaCL 1.7.1 Test: CPU BLAS - dDOT GB/s > Higher Is Better GPUFirstRun . 695 |============================================================ ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GB/s > Higher Is Better GPUFirstRun . 32.9 |=========================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GB/s > Higher Is Better GPUFirstRun . 709 |============================================================ ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GFLOPs/s > Higher Is Better GPUFirstRun . 79.2 |=========================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GFLOPs/s > Higher Is Better GPUFirstRun . 78.9 |=========================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GFLOPs/s > Higher Is Better GPUFirstRun . 86.1 |=========================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GFLOPs/s > Higher Is Better GPUFirstRun . 84.4 |=========================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY GB/s > Higher Is Better GPUFirstRun . 878 |============================================================ ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY GB/s > Higher Is Better GPUFirstRun . 1250 |=========================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT GB/s > Higher Is Better GPUFirstRun . 768 |============================================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY GB/s > Higher Is Better GPUFirstRun . 547 |============================================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY GB/s > Higher Is Better GPUFirstRun . 624 |============================================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT GB/s > Higher Is Better GPUFirstRun . 619 |============================================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N GB/s > Higher Is Better GPUFirstRun . 196 |============================================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T GB/s > Higher Is Better GPUFirstRun . 394 |============================================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN GFLOPs/s > Higher Is Better GPUFirstRun . 1183 |=========================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT GFLOPs/s > Higher Is Better GPUFirstRun . 1190 |=========================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN GFLOPs/s > Higher Is Better GPUFirstRun . 1180 |===========================================================