rocky-gpu-2023-08-17 AMD Ryzen 9 7950X 16-Core testing with a Gigabyte B650 AORUS ELITE AX (F4b BIOS) and NVIDIA GeForce RTX 4090 24GB on Ubuntu 22.04 via the Phoronix Test Suite. rocky-gpu-2023-08-17: Processor: AMD Ryzen 9 7950X 16-Core @ 4.50GHz (16 Cores / 32 Threads), Motherboard: Gigabyte B650 AORUS ELITE AX (F4b BIOS), Chipset: AMD Device 14d8, Memory: 2 x 32 GB DDR5-6000MT/s F5-6000J3238G32G, Disk: 2000GB Samsung SSD 990 PRO 2TB + 2 x 18000GB TOSHIBA MG09ACA1, Graphics: NVIDIA GeForce RTX 4090 24GB, Audio: NVIDIA Device 22ba, Network: Realtek RTL8125 2.5GbE + MEDIATEK Device 0616 OS: Ubuntu 22.04, Kernel: 5.15.0-79-generic (x86_64), Display Server: X Server, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.2.128, Vulkan: 1.3.242, Compiler: GCC 11.4.0 + CUDA 11.7, File-System: ext4 ArrayFire 3.7 Test: Conjugate Gradient OpenCL ms < Lower Is Better rocky-gpu-2023-08-17 . 0.8745 |================================================ Blender 3.6 Blend File: BMW27 - Compute: NVIDIA OptiX Seconds < Lower Is Better rocky-gpu-2023-08-17 . 3.45 |================================================== Blender 3.6 Blend File: Classroom - Compute: NVIDIA OptiX Seconds < Lower Is Better rocky-gpu-2023-08-17 . 7.07 |================================================== Blender 3.6 Blend File: Fishy Cat - Compute: NVIDIA OptiX Seconds < Lower Is Better rocky-gpu-2023-08-17 . 5.31 |================================================== Blender 3.6 Blend File: Barbershop - Compute: NVIDIA OptiX Seconds < Lower Is Better rocky-gpu-2023-08-17 . 29.34 |================================================= Blender 3.6 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX Seconds < Lower Is Better rocky-gpu-2023-08-17 . 8.10 |================================================== Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better cl-mem 2017-01-13 Benchmark: Copy GB/s > Higher Is Better rocky-gpu-2023-08-17 . 409.1 |================================================= cl-mem 2017-01-13 Benchmark: Read GB/s > Higher Is Better rocky-gpu-2023-08-17 . 889.2 |================================================= cl-mem 2017-01-13 Benchmark: Write GB/s > Higher Is Better rocky-gpu-2023-08-17 . 806.9 |================================================= clpeak 1.1.2 OpenCL Test: Integer Compute INT GIOPS > Higher Is Better rocky-gpu-2023-08-17 . 40829.93 |============================================== clpeak 1.1.2 OpenCL Test: Single-Precision Float GFLOPS > Higher Is Better rocky-gpu-2023-08-17 . 79753.35 |============================================== clpeak 1.1.2 OpenCL Test: Double-Precision Double GFLOPS > Higher Is Better rocky-gpu-2023-08-17 . 1388.89 |=============================================== clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth GBPS > Higher Is Better rocky-gpu-2023-08-17 . 873.15 |================================================ FAHBench 2.3.2 Ns Per Day > Higher Is Better rocky-gpu-2023-08-17 . 433.12 |================================================ FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL ms < Lower Is Better rocky-gpu-2023-08-17 . 2.895 |================================================= GROMACS 2023 Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare Ns Per Day > Higher Is Better Hashcat 6.2.4 Benchmark: MD5 H/s > Higher Is Better Hashcat 6.2.4 Benchmark: SHA1 H/s > Higher Is Better Hashcat 6.2.4 Benchmark: 7-Zip H/s > Higher Is Better Hashcat 6.2.4 Benchmark: SHA-512 H/s > Higher Is Better Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS H/s > Higher Is Better LeelaChessZero 0.28 Backend: OpenCL Nodes Per Second > Higher Is Better rocky-gpu-2023-08-17 . 45692 |================================================= LuxCoreRender 2.6 Scene: DLSC - Acceleration: GPU M samples/sec > Higher Is Better rocky-gpu-2023-08-17 . 26.02 |================================================= LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: GPU M samples/sec > Higher Is Better rocky-gpu-2023-08-17 . 20.49 |================================================= LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: GPU M samples/sec > Higher Is Better rocky-gpu-2023-08-17 . 20.40 |================================================= LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: GPU M samples/sec > Higher Is Better rocky-gpu-2023-08-17 . 21.26 |================================================= LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: GPU M samples/sec > Higher Is Better rocky-gpu-2023-08-17 . 45.06 |================================================= MandelGPU 1.3pts1 OpenCL Device: GPU Samples/sec > Higher Is Better Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Integer GIOPS > Higher Is Better rocky-gpu-2023-08-17 . 40183.58 |============================================== Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Integer GIOPS > Higher Is Better rocky-gpu-2023-08-17 . 34761.32 |============================================== Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision GFLOPS > Higher Is Better rocky-gpu-2023-08-17 . 1098.77 |=============================================== Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision GFLOPS > Higher Is Better rocky-gpu-2023-08-17 . 76445.25 |============================================== Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Half Precision GFLOPS > Higher Is Better rocky-gpu-2023-08-17 . 73760.20 |============================================== Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Double Precision GFLOPS > Higher Is Better rocky-gpu-2023-08-17 . 1085.41 |=============================================== Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Single Precision GFLOPS > Higher Is Better rocky-gpu-2023-08-17 . 71978.75 |============================================== NCNN 20230517 Target: Vulkan GPU - Model: mobilenet ms < Lower Is Better rocky-gpu-2023-08-17 . 7.92 |================================================== NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 ms < Lower Is Better rocky-gpu-2023-08-17 . 3.11 |================================================== NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 ms < Lower Is Better rocky-gpu-2023-08-17 . 3.14 |================================================== NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 ms < Lower Is Better rocky-gpu-2023-08-17 . 3.31 |================================================== NCNN 20230517 Target: Vulkan GPU - Model: mnasnet ms < Lower Is Better rocky-gpu-2023-08-17 . 2.93 |================================================== NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 ms < Lower Is Better rocky-gpu-2023-08-17 . 3.81 |================================================== NCNN 20230517 Target: Vulkan GPU - Model: blazeface ms < Lower Is Better rocky-gpu-2023-08-17 . 1.36 |================================================== NCNN 20230517 Target: Vulkan GPU - Model: googlenet ms < Lower Is Better rocky-gpu-2023-08-17 . 7.72 |================================================== NCNN 20230517 Target: Vulkan GPU - Model: vgg16 ms < Lower Is Better rocky-gpu-2023-08-17 . 20.32 |================================================= NCNN 20230517 Target: Vulkan GPU - Model: resnet18 ms < Lower Is Better rocky-gpu-2023-08-17 . 5.01 |================================================== NCNN 20230517 Target: Vulkan GPU - Model: alexnet ms < Lower Is Better rocky-gpu-2023-08-17 . 3.99 |================================================== NCNN 20230517 Target: Vulkan GPU - Model: resnet50 ms < Lower Is Better rocky-gpu-2023-08-17 . 9.50 |================================================== NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny ms < Lower Is Better rocky-gpu-2023-08-17 . 12.30 |================================================= NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd ms < Lower Is Better rocky-gpu-2023-08-17 . 6.72 |================================================== NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m ms < Lower Is Better rocky-gpu-2023-08-17 . 8.24 |================================================== NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer ms < Lower Is Better rocky-gpu-2023-08-17 . 31.04 |================================================= NCNN 20230517 Target: Vulkan GPU - Model: FastestDet ms < Lower Is Better rocky-gpu-2023-08-17 . 4.01 |================================================== NeatBench 5 Acceleration: GPU FPS > Higher Is Better rocky-gpu-2023-08-17 . 4090 |================================================== PlaidML FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL Examples Per Second > Higher Is Better RedShift Demo 3.0 Seconds < Lower Is Better Rodinia 3.1 Test: OpenCL Particle Filter Seconds < Lower Is Better rocky-gpu-2023-08-17 . 2.151 |================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D GFLOPS > Higher Is Better rocky-gpu-2023-08-17 . 643.44 |================================================ SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad GB/s > Higher Is Better rocky-gpu-2023-08-17 . 26.16 |================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP GFLOPS > Higher Is Better rocky-gpu-2023-08-17 . 2789.43 |=============================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash GHash/s > Higher Is Better rocky-gpu-2023-08-17 . 93.65 |================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction GB/s > Higher Is Better rocky-gpu-2023-08-17 . 972.75 |================================================ SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N GFLOPS > Higher Is Better rocky-gpu-2023-08-17 . 26870.9 |=============================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops GFLOPS > Higher Is Better rocky-gpu-2023-08-17 . 87267.3 |=============================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download GB/s > Higher Is Better rocky-gpu-2023-08-17 . 26.81 |================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback GB/s > Higher Is Better rocky-gpu-2023-08-17 . 26.35 |================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth GB/s > Higher Is Better rocky-gpu-2023-08-17 . 2975.91 |=============================================== ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GB/s > Higher Is Better rocky-gpu-2023-08-17 . 278 |=================================================== ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GB/s > Higher Is Better rocky-gpu-2023-08-17 . 417 |=================================================== ViennaCL 1.7.1 Test: CPU BLAS - sDOT GB/s > Higher Is Better rocky-gpu-2023-08-17 . 408 |=================================================== ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GB/s > Higher Is Better rocky-gpu-2023-08-17 . 79.4 |================================================== ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GB/s > Higher Is Better rocky-gpu-2023-08-17 . 120 |=================================================== ViennaCL 1.7.1 Test: CPU BLAS - dDOT GB/s > Higher Is Better rocky-gpu-2023-08-17 . 121 |=================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GB/s > Higher Is Better rocky-gpu-2023-08-17 . 136 |=================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GB/s > Higher Is Better rocky-gpu-2023-08-17 . 170 |=================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GFLOPs/s > Higher Is Better rocky-gpu-2023-08-17 . 110 |=================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GFLOPs/s > Higher Is Better rocky-gpu-2023-08-17 . 106 |=================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GFLOPs/s > Higher Is Better rocky-gpu-2023-08-17 . 116 |=================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GFLOPs/s > Higher Is Better rocky-gpu-2023-08-17 . 111 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY GB/s > Higher Is Better rocky-gpu-2023-08-17 . 435 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY GB/s > Higher Is Better rocky-gpu-2023-08-17 . 567 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT GB/s > Higher Is Better rocky-gpu-2023-08-17 . 447 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY GB/s > Higher Is Better rocky-gpu-2023-08-17 . 662 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY GB/s > Higher Is Better rocky-gpu-2023-08-17 . 777 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT GB/s > Higher Is Better rocky-gpu-2023-08-17 . 681 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N GB/s > Higher Is Better rocky-gpu-2023-08-17 . 219 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T GB/s > Higher Is Better rocky-gpu-2023-08-17 . 442 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN GFLOPs/s > Higher Is Better rocky-gpu-2023-08-17 . 1150 |================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT GFLOPs/s > Higher Is Better rocky-gpu-2023-08-17 . 1270 |================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN GFLOPs/s > Higher Is Better rocky-gpu-2023-08-17 . 1290 |================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT GFLOPs/s > Higher Is Better rocky-gpu-2023-08-17 . 1340 |==================================================