a AA A: Processor: AMD Ryzen Threadripper PRO 3955WX 16-Cores @ 3.90GHz (16 Cores / 32 Threads), Motherboard: Supermicro M12SWA-TF v1.01 (2.0c BIOS), Chipset: AMD Starship/Matisse, Memory: 512GB, Disk: 1024GB Kioxia KXG80ZNV1T02 + 2 x 8002GB Seagate ST8000NM008A-2KE, Graphics: NVIDIA GeForce RTX 4090 24GB, Audio: AMD Starship/Matisse, Monitor: HP S2231, Network: Intel I210 + Aquantia Device 14c0 OS: Linuxmint 21.1, Kernel: 5.15.0-56-generic (x86_64), Display Server: X Server 1.21.1.3, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.1.68, Vulkan: 1.3.236, Compiler: GCC 11.3.0 + CUDA 12.1, File-System: ext4, Screen Resolution: 1280x1024 Hashcat 6.2.4 Benchmark: MD5 H/s > Higher Is Better A . 306533333333 |============================================================= Hashcat 6.2.4 Benchmark: SHA1 H/s > Higher Is Better A . 99268500000 |============================================================== Hashcat 6.2.4 Benchmark: 7-Zip H/s > Higher Is Better A . 4986933 |================================================================== Hashcat 6.2.4 Benchmark: SHA-512 H/s > Higher Is Better A . 14584666667 |============================================================== Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS H/s > Higher Is Better A . 3748933 |================================================================== Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Integer GIOPS > Higher Is Better A . 40146.30 |================================================================= Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Integer GIOPS > Higher Is Better A . 34880.84 |================================================================= Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision GFLOPS > Higher Is Better A . 1093.42 |================================================================== Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision GFLOPS > Higher Is Better A . 76089.21 |================================================================= Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Half Precision GFLOPS > Higher Is Better A . 79528.92 |================================================================= Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Double Precision GFLOPS > Higher Is Better A . 1084.12 |================================================================== Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Single Precision GFLOPS > Higher Is Better A . 73822.68 |================================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth cl-mem 2017-01-13 Benchmark: Copy GB/s > Higher Is Better A . 410.8 |==================================================================== cl-mem 2017-01-13 Benchmark: Read GB/s > Higher Is Better A . 889.4 |==================================================================== cl-mem 2017-01-13 Benchmark: Write GB/s > Higher Is Better A . 806.7 |==================================================================== RedShift Demo 3.0 Seconds < Lower Is Better FAHBench 2.3.2 Ns Per Day > Higher Is Better A . 407.49 |=================================================================== clpeak 1.1.2 OpenCL Test: Integer Compute INT GIOPS > Higher Is Better A . 40787.38 |================================================================= clpeak 1.1.2 OpenCL Test: Single-Precision Float GFLOPS > Higher Is Better A . 79667.06 |================================================================= clpeak 1.1.2 OpenCL Test: Double-Precision Double GFLOPS > Higher Is Better A . 1399.57 |================================================================== clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth GBPS > Higher Is Better A . 873.00 |=================================================================== LeelaChessZero 0.28 Backend: OpenCL Nodes Per Second > Higher Is Better A . 17673 |==================================================================== Rodinia 3.1 Test: OpenCL Particle Filter Seconds < Lower Is Better A . 2.129 |==================================================================== ArrayFire 3.7 Test: Conjugate Gradient OpenCL ms < Lower Is Better A . 1.238 |==================================================================== LuxCoreRender 2.6 Scene: DLSC - Acceleration: GPU M samples/sec > Higher Is Better A . 50.89 |==================================================================== LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: GPU M samples/sec > Higher Is Better A . 34.01 |==================================================================== LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: GPU M samples/sec > Higher Is Better A . 37.64 |==================================================================== LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: GPU M samples/sec > Higher Is Better A . 30.89 |==================================================================== LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: GPU M samples/sec > Higher Is Better A . 78.19 |==================================================================== FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL ms < Lower Is Better A . 2.976 |==================================================================== ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GB/s > Higher Is Better A . 110 |====================================================================== ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GB/s > Higher Is Better A . 167 |====================================================================== ViennaCL 1.7.1 Test: CPU BLAS - sDOT GB/s > Higher Is Better A . 150 |====================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GB/s > Higher Is Better A . 55.2 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GB/s > Higher Is Better A . 80.5 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dDOT GB/s > Higher Is Better A . 71.7 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GB/s > Higher Is Better A . 70.3 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GB/s > Higher Is Better A . 83.6 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GFLOPs/s > Higher Is Better A . 86.5 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GFLOPs/s > Higher Is Better A . 84.4 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GFLOPs/s > Higher Is Better A . 92.1 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GFLOPs/s > Higher Is Better A . 87.5 |===================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY GB/s > Higher Is Better A . 522 |====================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY GB/s > Higher Is Better A . 649 |====================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT GB/s > Higher Is Better A . 437 |====================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY GB/s > Higher Is Better A . 657 |====================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY GB/s > Higher Is Better A . 772 |====================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT GB/s > Higher Is Better A . 667 |====================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N GB/s > Higher Is Better A . 219 |====================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T GB/s > Higher Is Better A . 440 |====================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN GFLOPs/s > Higher Is Better A . 1157 |===================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT GFLOPs/s > Higher Is Better A . 1283 |===================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN GFLOPs/s > Higher Is Better A . 1300 |===================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT GFLOPs/s > Higher Is Better A . 1343 |===================================================================== GROMACS 2023 Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare Ns Per Day > Higher Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better NCNN 20220729 Target: Vulkan GPU ms < Lower Is Better PlaidML FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL Examples Per Second > Higher Is Better Blender 3.5 Blend File: BMW27 - Compute: NVIDIA OptiX Seconds < Lower Is Better A . 18.31 |==================================================================== Blender 3.5 Blend File: Classroom - Compute: NVIDIA OptiX Seconds < Lower Is Better A . 4.57 |===================================================================== Blender 3.5 Blend File: Fishy Cat - Compute: NVIDIA OptiX Seconds < Lower Is Better A . 4.69 |===================================================================== Blender 3.5 Blend File: Barbershop - Compute: NVIDIA OptiX Seconds < Lower Is Better A . 20.05 |==================================================================== Blender 3.5 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX Seconds < Lower Is Better A . 5.58 |===================================================================== MandelGPU 1.3pts1 OpenCL Device: GPU Samples/sec > Higher Is Better NeatBench 5 Acceleration: GPU FPS > Higher Is Better A . 4090 |=====================================================================