y y y: Processor: Intel Core i7-13700 @ 5.10GHz (16 Cores / 24 Threads), Motherboard: HP 895C (U50 Ver. 02.04.02 BIOS), Chipset: Intel Device 7aa7, Memory: 2 x 16 GB DDR5-4400MT/s Samsung M323R2GA3BB0-CQKOL, Disk: 1024GB MTFDKBA1T0TFK-1BC1AABHA + 512GB MTFDKBA512TFK-1BC1AABHA, Graphics: NVIDIA T400 4GB, Audio: Realtek ALC222, Monitor: HP Z24i G2, Network: Intel + Intel Device 7af0 OS: Linuxmint 21.2, Kernel: 6.1.0-1033-oem (x86_64), Desktop: Cinnamon 5.8.4, Display Server: X Server 1.21.1.4, Display Driver: NVIDIA 535.154.05, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.2.148, Vulkan: 1.3.242, Compiler: GCC 11.4.0 + CUDA 11.5, File-System: ext4, Screen Resolution: 1920x1200 ArrayFire 3.9 Test: Conjugate Gradient OpenCL Betsy GPU Compressor 1.1 Beta Codec: ETC1 - Quality: Highest Seconds < Lower Is Better Betsy GPU Compressor 1.1 Beta Codec: ETC2 RGB - Quality: Highest Seconds < Lower Is Better Blender 4.0 Blend File: BMW27 - Compute: NVIDIA OptiX Seconds < Lower Is Better y . 158.28 |=================================================================== Blender 4.0 Blend File: Classroom - Compute: NVIDIA OptiX Seconds < Lower Is Better y . 362.28 |=================================================================== Blender 4.0 Blend File: Fishy Cat - Compute: NVIDIA OptiX Seconds < Lower Is Better y . 340.74 |=================================================================== Blender 4.0 Blend File: Barbershop - Compute: NVIDIA OptiX Seconds < Lower Is Better y . 1379.69 |================================================================== Blender 4.0 Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX Seconds < Lower Is Better y . 836.36 |=================================================================== Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200 Milli-Seconds < Lower Is Better Caffe 2020-02-13 Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000 Milli-Seconds < Lower Is Better cl-mem 2017-01-13 Benchmark: Copy GB/s > Higher Is Better y . 61.9 |===================================================================== cl-mem 2017-01-13 Benchmark: Read GB/s > Higher Is Better y . 71.2 |===================================================================== cl-mem 2017-01-13 Benchmark: Write GB/s > Higher Is Better y . 65.7 |===================================================================== clpeak 1.1.2 OpenCL Test: Integer Compute INT GIOPS > Higher Is Better y . 515.21 |=================================================================== clpeak 1.1.2 OpenCL Test: Single-Precision Float GFLOPS > Higher Is Better y . 789.45 |=================================================================== clpeak 1.1.2 OpenCL Test: Double-Precision Double GFLOPS > Higher Is Better y . 44.57 |==================================================================== clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth GBPS > Higher Is Better y . 67.03 |==================================================================== Compile Bench 0.6 Test: Compile MB/s > Higher Is Better y . 2595.20 |================================================================== Compile Bench 0.6 Test: Initial Create MB/s > Higher Is Better y . 699.13 |=================================================================== Compile Bench 0.6 Test: Read Compiled Tree MB/s > Higher Is Better y . 1465.81 |================================================================== Dbench 4.0 12 Clients MB/s > Higher Is Better y . 687.68 |=================================================================== Dbench 4.0 1 Clients MB/s > Higher Is Better y . 170.98 |=================================================================== FinanceBench 2016-07-25 Benchmark: Black-Scholes OpenCL ms < Lower Is Better y . 240.70 |=================================================================== Flexible IO Tester 3.36 Type: Random Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory MB/s > Higher Is Better y . 0.665 |==================================================================== Flexible IO Tester 3.36 Type: Random Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory IOPS > Higher Is Better y . 680889 |=================================================================== Flexible IO Tester 3.36 Type: Random Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory MB/s > Higher Is Better y . 0.690 |==================================================================== Flexible IO Tester 3.36 Type: Random Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory IOPS > Higher Is Better y . 706333 |=================================================================== Flexible IO Tester 3.36 Type: Random Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory MB/s > Higher Is Better y . 0.831 |==================================================================== Flexible IO Tester 3.36 Type: Random Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory IOPS > Higher Is Better y . 850933 |=================================================================== Flexible IO Tester 3.36 Type: Random Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory MB/s > Higher Is Better y . 0.817 |==================================================================== Flexible IO Tester 3.36 Type: Random Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory IOPS > Higher Is Better y . 836533 |=================================================================== Flexible IO Tester 3.36 Type: Sequential Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory MB/s > Higher Is Better y . 1.235 |==================================================================== Flexible IO Tester 3.36 Type: Sequential Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory IOPS > Higher Is Better y . 1264778 |================================================================== Flexible IO Tester 3.36 Type: Sequential Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory MB/s > Higher Is Better y . 1.216 |==================================================================== Flexible IO Tester 3.36 Type: Sequential Read - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory IOPS > Higher Is Better y . 1245000 |================================================================== Flexible IO Tester 3.36 Type: Sequential Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory MB/s > Higher Is Better y . 1.100 |==================================================================== Flexible IO Tester 3.36 Type: Sequential Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory IOPS > Higher Is Better y . 1126067 |================================================================== Flexible IO Tester 3.36 Type: Sequential Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory MB/s > Higher Is Better y . 1.069 |==================================================================== Flexible IO Tester 3.36 Type: Sequential Write - IO Engine: Linux AIO - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory IOPS > Higher Is Better y . 1095000 |================================================================== FS-Mark 3.3 Test: 1000 Files, 1MB Size Files/s > Higher Is Better y . 261.7 |==================================================================== FS-Mark 3.3 Test: 5000 Files, 1MB Size, 4 Threads Files/s > Higher Is Better y . 690.2 |==================================================================== FS-Mark 3.3 Test: 4000 Files, 32 Sub Dirs, 1MB Size Files/s > Higher Is Better y . 245.5 |==================================================================== FS-Mark 3.3 Test: 1000 Files, 1MB Size, No Sync/FSync Files/s > Higher Is Better y . 2426.6 |=================================================================== GROMACS 2023 Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare Ns Per Day > Higher Is Better Hashcat 6.2.4 Benchmark: MD5 H/s > Higher Is Better y . 4220333333 |=============================================================== Hashcat 6.2.4 Benchmark: SHA1 H/s > Higher Is Better y . 1418466667 |=============================================================== Hashcat 6.2.4 Benchmark: 7-Zip H/s > Higher Is Better Hashcat 6.2.4 Benchmark: SHA-512 H/s > Higher Is Better y . 201533333 |================================================================ Hashcat 6.2.4 Benchmark: TrueCrypt RIPEMD160 + XTS H/s > Higher Is Better IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Bedroom M samples/s > Higher Is Better y . 1.323 |==================================================================== IndigoBench 4.4 Acceleration: OpenCL GPU - Scene: Supercar M samples/s > Higher Is Better y . 4.120 |==================================================================== IOR 3.3.0 Block Size: 2MB - Disk Target: / MB/s > Higher Is Better y . 1207.29 |================================================================== IOR 3.3.0 Block Size: 4MB - Disk Target: / MB/s > Higher Is Better y . 1219.11 |================================================================== IOR 3.3.0 Block Size: 8MB - Disk Target: / MB/s > Higher Is Better y . 1267.12 |================================================================== IOR 3.3.0 Block Size: 16MB - Disk Target: / MB/s > Higher Is Better y . 1340.16 |================================================================== IOR 3.3.0 Block Size: 32MB - Disk Target: / MB/s > Higher Is Better y . 1457.85 |================================================================== IOR 3.3.0 Block Size: 64MB - Disk Target: / MB/s > Higher Is Better y . 1415.71 |================================================================== IOR 3.3.0 Block Size: 256MB - Disk Target: / MB/s > Higher Is Better y . 1454.17 |================================================================== IOR 3.3.0 Block Size: 512MB - Disk Target: / MB/s > Higher Is Better y . 1703.98 |================================================================== IOR 3.3.0 Block Size: 1024MB - Disk Target: / MB/s > Higher Is Better y . 1902.18 |================================================================== IOR 3.3.0 Block Size: 2MB - Disk Target: Default Test Directory MB/s > Higher Is Better y . 1236.65 |================================================================== IOR 3.3.0 Block Size: 4MB - Disk Target: Default Test Directory MB/s > Higher Is Better y . 1250.62 |================================================================== IOR 3.3.0 Block Size: 8MB - Disk Target: Default Test Directory MB/s > Higher Is Better y . 1267.36 |================================================================== IOR 3.3.0 Block Size: 16MB - Disk Target: Default Test Directory MB/s > Higher Is Better y . 1352.41 |================================================================== IOR 3.3.0 Block Size: 32MB - Disk Target: Default Test Directory MB/s > Higher Is Better y . 1430.32 |================================================================== IOR 3.3.0 Block Size: 64MB - Disk Target: Default Test Directory MB/s > Higher Is Better y . 1401.16 |================================================================== IOR 3.3.0 Block Size: 256MB - Disk Target: Default Test Directory MB/s > Higher Is Better y . 1448.84 |================================================================== IOR 3.3.0 Block Size: 512MB - Disk Target: Default Test Directory MB/s > Higher Is Better y . 1697.55 |================================================================== IOR 3.3.0 Block Size: 1024MB - Disk Target: Default Test Directory MB/s > Higher Is Better y . 1896.13 |================================================================== IOzone 3.465 8GB Write Performance MB/s > Higher Is Better LeelaChessZero 0.30 Backend: OpenCL Nodes Per Second > Higher Is Better Libplacebo 5.229.1 FPS > Higher Is Better MandelGPU 1.3pts1 OpenCL Device: GPU Samples/sec > Higher Is Better y . 53579848.2 |=============================================================== Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Integer Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Integer Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Double Precision Mixbench 2020-06-23 Backend: OpenCL - Benchmark: Single Precision Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Half Precision Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Double Precision Mixbench 2020-06-23 Backend: NVIDIA CUDA - Benchmark: Single Precision NAMD CUDA 2.14 ATPase Simulation - 327,506 Atoms days/ns < Lower Is Better y . 0.83559 |================================================================== NeatBench 5 Acceleration: GPU FPS > Higher Is Better y . 7.75 |===================================================================== PlaidML FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: Yes - Mode: Inference - Network: Mobilenet - Device: OpenCL Examples Per Second > Higher Is Better PlaidML FP16: No - Mode: Inference - Network: DenseNet 201 - Device: OpenCL Examples Per Second > Higher Is Better PostMark 1.51 Disk Transaction Performance TPS > Higher Is Better y . 6919 |===================================================================== RealSR-NCNN 20200818 Scale: 4x - TAA: No Seconds < Lower Is Better y . 63.92 |==================================================================== RealSR-NCNN 20200818 Scale: 4x - TAA: Yes Seconds < Lower Is Better y . 492.19 |=================================================================== RedShift Demo 3.0 Seconds < Lower Is Better Rodinia 3.1 Test: OpenCL Particle Filter Seconds < Lower Is Better y . 42.27 |==================================================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth SQLite 3.41.2 Threads / Copies: 1 Seconds < Lower Is Better y . 31.68 |==================================================================== SQLite 3.41.2 Threads / Copies: 2 Seconds < Lower Is Better y . 61.19 |==================================================================== SQLite 3.41.2 Threads / Copies: 4 Seconds < Lower Is Better y . 81.18 |==================================================================== SQLite 3.41.2 Threads / Copies: 8 Seconds < Lower Is Better y . 120.91 |=================================================================== SQLite 3.41.2 Threads / Copies: 16 Seconds < Lower Is Better y . 182.75 |=================================================================== SQLite 3.41.2 Threads / Copies: 24 Seconds < Lower Is Better y . 240.93 |=================================================================== ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GB/s > Higher Is Better y . 76.3 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GB/s > Higher Is Better y . 88.9 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - sDOT GB/s > Higher Is Better y . 92.6 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GB/s > Higher Is Better y . 51.7 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GB/s > Higher Is Better y . 55.0 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dDOT GB/s > Higher Is Better y . 64.9 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GB/s > Higher Is Better y . 68.6 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GB/s > Higher Is Better y . 71.8 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GFLOPs/s > Higher Is Better y . 58.3 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GFLOPs/s > Higher Is Better y . 57.9 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GFLOPs/s > Higher Is Better y . 62.5 |===================================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GFLOPs/s > Higher Is Better y . 61.1 |===================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY GB/s > Higher Is Better y . 53.8 |===================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY GB/s > Higher Is Better y . 65.3 |===================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT GB/s > Higher Is Better y . 68.3 |===================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY GB/s > Higher Is Better y . 68.7 |===================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY GB/s > Higher Is Better y . 70.5 |===================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT GB/s > Higher Is Better y . 76.0 |===================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N GB/s > Higher Is Better y . 76.1 |===================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T GB/s > Higher Is Better y . 71.1 |===================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN GFLOPs/s > Higher Is Better y . 43.7 |===================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT GFLOPs/s > Higher Is Better y . 44.1 |===================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN GFLOPs/s > Higher Is Better y . 43.5 |===================================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT GFLOPs/s > Higher Is Better y . 43.7 |===================================================================== VkFFT 1.2.31 Test: FFT + iFFT R2C / C2R Benchmark Score > Higher Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C 1D batched in half precision Benchmark Score > Higher Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C Bluestein in single precision Benchmark Score > Higher Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C 1D batched in double precision Benchmark Score > Higher Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C 1D batched in single precision Benchmark Score > Higher Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C multidimensional in single precision Benchmark Score > Higher Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C Bluestein benchmark in double precision Benchmark Score > Higher Is Better VkFFT 1.2.31 Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling Benchmark Score > Higher Is Better vkpeak 20230730 fp32-scalar GFLOPS > Higher Is Better y . 1327.83 |================================================================== vkpeak 20230730 fp32-vec4 GFLOPS > Higher Is Better y . 1319.34 |================================================================== vkpeak 20230730 fp16-scalar GFLOPS > Higher Is Better y . 1249.31 |================================================================== vkpeak 20230730 fp16-vec4 GFLOPS > Higher Is Better y . 2464.89 |================================================================== vkpeak 20230730 fp64-scalar GFLOPS > Higher Is Better y . 44.50 |==================================================================== vkpeak 20230730 fp64-vec4 GFLOPS > Higher Is Better y . 44.5 |===================================================================== vkpeak 20230730 int32-scalar GIOPS > Higher Is Better y . 1306.04 |================================================================== vkpeak 20230730 int32-vec4 GIOPS > Higher Is Better y . 1297.44 |================================================================== vkpeak 20230730 int16-scalar GIOPS > Higher Is Better y . 825.87 |=================================================================== vkpeak 20230730 int16-vec4 GIOPS > Higher Is Better y . 1028.69 |================================================================== VkResample 1.0 Upscale: 2x - Precision: Double ms < Lower Is Better y . 500.01 |=================================================================== VkResample 1.0 Upscale: 2x - Precision: Single ms < Lower Is Better y . 151.40 |=================================================================== Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: No Seconds < Lower Is Better y . 4.415 |==================================================================== Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: Yes Seconds < Lower Is Better y . 24.67 |====================================================================