ptsopenclbenchmark Intel Core i9-13980HX testing with a MSI MS-17S1 (E17S1IMS.30D BIOS) and MSI Intel RPL-S 16GB on Ubuntu 22.04 via the Phoronix Test Suite. pts_opencl_benchmark: Processor: Intel Core i9-13980HX @ 5.60GHz (24 Cores / 32 Threads), Motherboard: MSI MS-17S1 (E17S1IMS.30D BIOS), Chipset: Intel Device 7a27, Memory: 32GB, Disk: 2048GB Micron_3400_MTFDKBA2T0TFH, Graphics: MSI Intel RPL-S 16GB (210/405MHz), Audio: Intel Device 7a50, Monitor: PA248, Network: Realtek RTL8125 2.5GbE + Intel Device 7a70 OS: Ubuntu 22.04, Kernel: 6.5.0-26-generic (x86_64), Desktop: Cinnamon 5.2.7, Display Server: X Server 1.21.1.4, Display Driver: NVIDIA 535.161.07, OpenGL: 4.6 Mesa 23.2.1-1ubuntu3.1~22.04.2, OpenCL: OpenCL 3.0 CUDA 12.2.148, Vulkan: 1.3.255, Compiler: GCC 11.4.0 + CUDA 12.2, File-System: ext4, Screen Resolution: 1920x1200 SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D GFLOPS > Higher Is Better pts_opencl_benchmark . 367.73 |================================================ SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad GB/s > Higher Is Better pts_opencl_benchmark . 12.78 |================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP GFLOPS > Higher Is Better pts_opencl_benchmark . 1539.83 |=============================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash GHash/s > Higher Is Better pts_opencl_benchmark . 45.74 |================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction GB/s > Higher Is Better pts_opencl_benchmark . 541.91 |================================================ SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N GFLOPS > Higher Is Better pts_opencl_benchmark . 13385.5 |=============================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops GFLOPS > Higher Is Better pts_opencl_benchmark . 44825.1 |=============================================== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download GB/s > Higher Is Better pts_opencl_benchmark . 12.92 |================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback GB/s > Higher Is Better pts_opencl_benchmark . 13.19 |================================================= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth GB/s > Higher Is Better pts_opencl_benchmark . 2491.20 |=============================================== cl-mem 2017-01-13 Benchmark: Copy GB/s > Higher Is Better pts_opencl_benchmark . 345.7 |================================================= cl-mem 2017-01-13 Benchmark: Read GB/s > Higher Is Better pts_opencl_benchmark . 521.1 |================================================= cl-mem 2017-01-13 Benchmark: Write GB/s > Higher Is Better pts_opencl_benchmark . 514.8 |================================================= FluidX3D 2.9 Test: FP32-FP32 MLUPs/s > Higher Is Better pts_opencl_benchmark . 3357 |================================================== FluidX3D 2.9 Test: FP32-FP16C MLUPs/s > Higher Is Better pts_opencl_benchmark . 6880 |================================================== FluidX3D 2.9 Test: FP32-FP16S MLUPs/s > Higher Is Better pts_opencl_benchmark . 6780 |================================================== clpeak 1.1.2 OpenCL Test: Kernel Latency us < Lower Is Better pts_opencl_benchmark . 3.66 |================================================== clpeak 1.1.2 OpenCL Test: Integer Compute GIOPS > Higher Is Better pts_opencl_benchmark . 19644.88 |============================================== clpeak 1.1.2 OpenCL Test: Integer 24-bit Compute GIOPS > Higher Is Better pts_opencl_benchmark . 19674.45 |============================================== clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth GBPS > Higher Is Better pts_opencl_benchmark . 518.27 |================================================ clpeak 1.1.2 OpenCL Test: Double-Precision Compute GFLOPS > Higher Is Better pts_opencl_benchmark . 702.62 |================================================ clpeak 1.1.2 OpenCL Test: Single-Precision Compute GFLOPS > Higher Is Better pts_opencl_benchmark . 38349.93 |============================================== clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueReadBuffer GBPS > Higher Is Better pts_opencl_benchmark . 12.86 |================================================= clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueWriteBuffer GBPS > Higher Is Better pts_opencl_benchmark . 12.00 |================================================= Parboil 2.5 Test: OpenCL BFS Seconds < Lower Is Better Parboil 2.5 Test: OpenCL LBM Seconds < Lower Is Better Parboil 2.5 Test: OpenCL Histo Seconds < Lower Is Better Parboil 2.5 Test: OpenCL TPACF Seconds < Lower Is Better Parboil 2.5 Test: OpenCL MRI Gridding Seconds < Lower Is Better Rodinia 3.1 Test: OpenCL LavaMD Seconds < Lower Is Better Rodinia 3.1 Test: OpenCL Myocyte Seconds < Lower Is Better pts_opencl_benchmark . 21.45 |================================================= Rodinia 3.1 Test: OpenCL Heartwall Seconds < Lower Is Better Rodinia 3.1 Test: OpenCL Leukocyte Seconds < Lower Is Better pts_opencl_benchmark . 2.670 |================================================= Rodinia 3.1 Test: OpenCL Particle Filter Seconds < Lower Is Better pts_opencl_benchmark . 3.296 |================================================= ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GB/s > Higher Is Better pts_opencl_benchmark . 112 |=================================================== ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GB/s > Higher Is Better pts_opencl_benchmark . 135 |=================================================== ViennaCL 1.7.1 Test: CPU BLAS - sDOT GB/s > Higher Is Better pts_opencl_benchmark . 139 |=================================================== ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GB/s > Higher Is Better pts_opencl_benchmark . 59.2 |================================================== ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GB/s > Higher Is Better pts_opencl_benchmark . 73.4 |================================================== ViennaCL 1.7.1 Test: CPU BLAS - dDOT GB/s > Higher Is Better pts_opencl_benchmark . 83.5 |================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GB/s > Higher Is Better pts_opencl_benchmark . 84.4 |================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GB/s > Higher Is Better pts_opencl_benchmark . 92.5 |================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GFLOPs/s > Higher Is Better pts_opencl_benchmark . 100.7 |================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GFLOPs/s > Higher Is Better pts_opencl_benchmark . 102.8 |================================================= ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GFLOPs/s > Higher Is Better pts_opencl_benchmark . 108 |=================================================== ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GFLOPs/s > Higher Is Better pts_opencl_benchmark . 112 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY GB/s > Higher Is Better pts_opencl_benchmark . 348 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY GB/s > Higher Is Better pts_opencl_benchmark . 441 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT GB/s > Higher Is Better pts_opencl_benchmark . 380 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY GB/s > Higher Is Better pts_opencl_benchmark . 475 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY GB/s > Higher Is Better pts_opencl_benchmark . 522 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT GB/s > Higher Is Better pts_opencl_benchmark . 523 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N GB/s > Higher Is Better pts_opencl_benchmark . 197 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T GB/s > Higher Is Better pts_opencl_benchmark . 389 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN GFLOPs/s > Higher Is Better pts_opencl_benchmark . 627 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT GFLOPs/s > Higher Is Better pts_opencl_benchmark . 644 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN GFLOPs/s > Higher Is Better pts_opencl_benchmark . 670 |=================================================== ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT GFLOPs/s > Higher Is Better pts_opencl_benchmark . 687 |=================================================== Darktable 3.8.1 Test: Boat - Acceleration: OpenCL Seconds < Lower Is Better pts_opencl_benchmark . 1.280 |================================================= Darktable 3.8.1 Test: Masskrug - Acceleration: OpenCL Seconds < Lower Is Better pts_opencl_benchmark . 1.699 |================================================= Darktable 3.8.1 Test: Server Rack - Acceleration: OpenCL Seconds < Lower Is Better pts_opencl_benchmark . 0.119 |================================================= Darktable 3.8.1 Test: Server Room - Acceleration: OpenCL Seconds < Lower Is Better pts_opencl_benchmark . 0.687 |================================================= Xsbench OpenCL 2017-07-06 Lookups/s > Higher Is Better JuliaGPU 1.2pts1 OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT Samples/sec > Higher Is Better MandelbulbGPU 1.0pts1 OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT Samples/sec > Higher Is Better MandelGPU 1.3pts1 OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT Samples/sec > Higher Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Caustic Samples/sec > Higher Is Better pts_opencl_benchmark . 1711642497 |============================================ SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Cornell Samples/sec > Higher Is Better pts_opencl_benchmark . 1711642633 |============================================ SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Caustic3 Samples/sec > Higher Is Better pts_opencl_benchmark . 1711642770 |============================================ LuxMark 3.1 OpenCL Device: CPU - Scene: Hotel Score > Higher Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Hotel Score > Higher Is Better pts_opencl_benchmark . 17792 |================================================= LuxMark 3.1 OpenCL Device: CPU+GPU - Scene: Hotel Score > Higher Is Better pts_opencl_benchmark . 17792 |================================================= LuxMark 3.1 OpenCL Device: CPU - Scene: Microphone Score > Higher Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Microphone Score > Higher Is Better pts_opencl_benchmark . 55201 |================================================= LuxMark 3.1 OpenCL Device: CPU - Scene: Luxball HDR Score > Higher Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Luxball HDR Score > Higher Is Better pts_opencl_benchmark . 82081 |================================================= LuxMark 3.1 OpenCL Device: Hybrid GPU - Scene: Hotel Score > Higher Is Better LuxMark 3.1 OpenCL Device: CPU+GPU - Scene: Microphone Score > Higher Is Better pts_opencl_benchmark . 55213 |================================================= LuxMark 3.1 OpenCL Device: CPU+GPU - Scene: Luxball HDR Score > Higher Is Better pts_opencl_benchmark . 82042 |================================================= LuxMark 3.1 OpenCL Device: Hybrid GPU - Scene: Microphone Score > Higher Is Better LuxMark 3.1 OpenCL Device: Hybrid GPU - Scene: Luxball HDR Score > Higher Is Better OpenDwarfs 2013-11-06 Test: LU Decomposition ms < Lower Is Better OpenDwarfs 2013-11-06 Test: Compressed Sparse Row ms < Lower Is Better OpenDwarfs 2013-11-06 Test: Cyclic Redundancy Check ms < Lower Is Better Lulesh OpenCL 2017-07-06 z/s > Higher Is Better pts_opencl_benchmark . 8852.80 |===============================================