results_05.20.23 2 x AMD EPYC 7282 16-Core testing with a Supermicro H11DSi-NT v2.00 (2.1 BIOS) and NVIDIA RTX A5000 24GB on Debian 11 via the Phoronix Test Suite. initial: Processor: 2 x AMD EPYC 7282 16-Core @ 2.80GHz (32 Cores / 64 Threads), Motherboard: Supermicro H11DSi-NT v2.00 (2.1 BIOS), Chipset: AMD Starship/Matisse, Memory: 3 x 64 GB DDR4-2667MT/s 72ASS8G72LZ-2G6D2, Disk: Samsung SSD 970 EVO Plus 500GB + 2 x 4001GB SanDisk SDSSDH3 + 2 x 2000GB Samsung SSD 860 + 16001GB Seagate ST16000NM001G-2K + 1000GB Western Digital WDBNCE0010P + 120GB BIWIN SSD, Graphics: NVIDIA RTX A5000 24GB, Audio: NVIDIA GA102 HD Audio, Network: 2 x Intel 10G X550T OS: Debian 11, Kernel: 5.15.107-2-pve (x86_64), Display Server: X Server, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.1.68, Vulkan: 1.3.236, Compiler: GCC 10.2.1 20210110 + CUDA 12.1, File-System: ext4, Screen Resolution: 1024x768 Hashcat 6.2.4 Benchmark: MD5 H/s > Higher Is Better initial . 48256933333 |======================================================== Hashcat 6.2.4 Benchmark: SHA1 H/s > Higher Is Better initial . 15821766667 |======================================================== Hashcat 6.2.4 Benchmark: 7-Zip H/s > Higher Is Better initial . 778867 |============================================================= Hashcat 6.2.4 Benchmark: SHA-512 H/s > Higher Is Better initial . 2255533333 |========================================================= Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS H/s > Higher Is Better initial . 573667 |============================================================= Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Integer GIOPS > Higher Is Better initial . 15426.76 |=========================================================== Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Integer GIOPS > Higher Is Better initial . 13380.31 |=========================================================== Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision GFLOPS > Higher Is Better initial . 410.5 |============================================================== Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision GFLOPS > Higher Is Better initial . 29285.69 |=========================================================== Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Half Precision GFLOPS > Higher Is Better initial . 30554.07 |=========================================================== Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Double Precision GFLOPS > Higher Is Better initial . 408.85 |============================================================= Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Single Precision GFLOPS > Higher Is Better initial . 27232.05 |=========================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth cl-mem 2017-01-13 Benchmark: Copy GB/s > Higher Is Better initial . 338.9 |============================================================== cl-mem 2017-01-13 Benchmark: Read GB/s > Higher Is Better initial . 662.9 |============================================================== cl-mem 2017-01-13 Benchmark: Write GB/s > Higher Is Better initial . 657.3 |============================================================== RedShift Demo 3.0 Seconds < Lower Is Better FAHBench 2.3.2 Ns Per Day > Higher Is Better initial . 268.79 |============================================================= clpeak 1.1.2 OpenCL Test: Integer Compute INT GIOPS > Higher Is Better initial . 13665.69 |=========================================================== clpeak 1.1.2 OpenCL Test: Single-Precision Float GFLOPS > Higher Is Better initial . 26593.20 |=========================================================== clpeak 1.1.2 OpenCL Test: Double-Precision Double GFLOPS > Higher Is Better initial . 487.73 |============================================================= clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth GBPS > Higher Is Better initial . 658.31 |============================================================= LeelaChessZero 0.28 Backend: OpenCL Nodes Per Second > Higher Is Better Rodinia 3.1 Test: OpenCL Particle Filter Seconds < Lower Is Better initial . 4.905 |============================================================== ArrayFire 3.7 Test: Conjugate Gradient OpenCL ms < Lower Is Better initial . 1.747 |============================================================== LuxCoreRender 2.6 Scene: DLSC - Acceleration: GPU M samples/sec > Higher Is Better initial . 11.16 |============================================================== LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: GPU M samples/sec > Higher Is Better initial . 7.33 |=============================================================== LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: GPU M samples/sec > Higher Is Better initial . 9.89 |=============================================================== LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: GPU M samples/sec > Higher Is Better initial . 9.23 |=============================================================== LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: GPU M samples/sec > Higher Is Better initial . 29.62 |============================================================== FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL ms < Lower Is Better initial . 7.638 |============================================================== ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GB/s > Higher Is Better initial . 948 |================================================================ ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GB/s > Higher Is Better initial . 1227 |=============================================================== ViennaCL 1.7.1 Test: CPU BLAS - sDOT GB/s > Higher Is Better initial . 408 |================================================================ ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GB/s > Higher Is Better initial . 44.9 |=============================================================== ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GB/s > Higher Is Better initial . 67.9 |=============================================================== ViennaCL 1.7.1 Test: CPU BLAS - dDOT GB/s > Higher Is Better initial . 64.9 |=============================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GB/s > Higher Is Better initial . 70.7 |=============================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GB/s > Higher Is Better initial . 279 |================================================================ ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GFLOPs/s > Higher Is Better initial . 74.4 |=============================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GFLOPs/s > Higher Is Better initial . 73.2 |=============================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GFLOPs/s > Higher Is Better initial . 87.9 |=============================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GFLOPs/s > Higher Is Better initial . 79.7 |=============================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY GB/s > Higher Is Better initial . 331 |================================================================ ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY GB/s > Higher Is Better initial . 449 |================================================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY GB/s > Higher Is Better initial . 532 |================================================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY GB/s > Higher Is Better initial . 616 |================================================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT GB/s > Higher Is Better initial . 567 |================================================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N GB/s > Higher Is Better initial . 173 |================================================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T GB/s > Higher Is Better initial . 345 |================================================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN GFLOPs/s > Higher Is Better initial . 448 |================================================================ ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT GFLOPs/s > Higher Is Better initial . 449 |================================================================ ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT GB/s > Higher Is Better initial . 332 |================================================================ GROMACS 2023 Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare Ns Per Day > Higher Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better initial . 820.58 |============================================================= Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better initial . 1629.07 |============================================================ Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better initial . 8138.44 |============================================================ Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better initial . 3063.99 |============================================================ Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better initial . 6105.65 |============================================================ Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better initial . 30140.6 |============================================================ NCNN 20220729 Target: Vulkan GPU ms < Lower Is Better PlaidML FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL Examples Per Second > Higher Is Better Blender 3.5 Blend File: BMW27 - Compute: NVIDIA OptiX Seconds < Lower Is Better initial . 7.57 |=============================================================== Blender 3.5 Blend File: Classroom - Compute: NVIDIA OptiX Seconds < Lower Is Better initial . 17.60 |============================================================== Blender 3.5 Blend File: Fishy Cat - Compute: NVIDIA OptiX Seconds < Lower Is Better initial . 14.63 |============================================================== Blender 3.5 Blend File: Barbershop - Compute: NVIDIA OptiX Seconds < Lower Is Better initial . 66.33 |============================================================== Blender 3.5 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX Seconds < Lower Is Better initial . 19.64 |============================================================== MandelGPU 1.3pts1 OpenCL Device: GPU Samples/sec > Higher Is Better NeatBench 5 Acceleration: GPU FPS > Higher Is Better initial . 55.1 |===============================================================