Intel Core i9-13980HX testing with a MSI MS-17S1 (E17S1IMS.30D BIOS) and MSI Intel RPL-S 16GB on Ubuntu 22.04 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2403285-NE-PTSOPENCL91 ptsopenclbenchmark - Phoronix Test Suite ptsopenclbenchmark Intel Core i9-13980HX testing with a MSI MS-17S1 (E17S1IMS.30D BIOS) and MSI Intel RPL-S 16GB on Ubuntu 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2403285-NE-PTSOPENCL91 .
ptsopenclbenchmark Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution pts_opencl_benchmark Intel Core i9-13980HX @ 5.60GHz (24 Cores / 32 Threads) MSI MS-17S1 (E17S1IMS.30D BIOS) Intel Device 7a27 32GB 2048GB Micron_3400_MTFDKBA2T0TFH MSI Intel RPL-S 16GB (210/405MHz) Intel Device 7a50 PA248 Realtek RTL8125 2.5GbE + Intel Device 7a70 Ubuntu 22.04 6.5.0-26-generic (x86_64) Cinnamon 5.2.7 X Server 1.21.1.4 NVIDIA 535.161.07 4.6 Mesa 23.2.1-1ubuntu3.1~22.04.2 OpenCL 3.0 CUDA 12.2.148 1.3.255 GCC 11.4.0 + CUDA 12.2 ext4 1920x1200 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x11d - Thermald 2.4.9 - BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.2a.00.2a - GPU Compute Cores: 9728 - Python 3.10.12 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected
ptsopenclbenchmark shoc: OpenCL - S3D shoc: OpenCL - Triad shoc: OpenCL - FFT SP shoc: OpenCL - MD5 Hash shoc: OpenCL - Reduction shoc: OpenCL - GEMM SGEMM_N shoc: OpenCL - Max SP Flops shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Texture Read Bandwidth cl-mem: Copy cl-mem: Read cl-mem: Write fluidx3d: FP32-FP32 fluidx3d: FP32-FP16C fluidx3d: FP32-FP16S clpeak: Kernel Latency clpeak: Integer Compute clpeak: Integer 24-bit Compute clpeak: Global Memory Bandwidth clpeak: Double-Precision Compute clpeak: Single-Precision Compute clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Transfer Bandwidth enqueueWriteBuffer rodinia: OpenCL Myocyte rodinia: OpenCL Leukocyte rodinia: OpenCL Particle Filter viennacl: CPU BLAS - sCOPY viennacl: CPU BLAS - sAXPY viennacl: CPU BLAS - sDOT viennacl: CPU BLAS - dCOPY viennacl: CPU BLAS - dAXPY viennacl: CPU BLAS - dDOT viennacl: CPU BLAS - dGEMV-N viennacl: CPU BLAS - dGEMV-T viennacl: CPU BLAS - dGEMM-NN viennacl: CPU BLAS - dGEMM-NT viennacl: CPU BLAS - dGEMM-TT viennacl: CPU BLAS - dGEMM-TN viennacl: OpenCL BLAS - sCOPY viennacl: OpenCL BLAS - sAXPY viennacl: OpenCL BLAS - sDOT viennacl: OpenCL BLAS - dCOPY viennacl: OpenCL BLAS - dAXPY viennacl: OpenCL BLAS - dDOT viennacl: OpenCL BLAS - dGEMV-N viennacl: OpenCL BLAS - dGEMV-T viennacl: OpenCL BLAS - dGEMM-NN viennacl: OpenCL BLAS - dGEMM-NT viennacl: OpenCL BLAS - dGEMM-TN viennacl: OpenCL BLAS - dGEMM-TT darktable: Boat - OpenCL darktable: Masskrug - OpenCL darktable: Server Rack - OpenCL darktable: Server Room - OpenCL smallpt-gpu: GPU - 1920 x 1200 - Caustic smallpt-gpu: GPU - 1920 x 1200 - Cornell smallpt-gpu: GPU - 1920 x 1200 - Caustic3 luxmark: GPU - Hotel luxmark: CPU+GPU - Hotel luxmark: GPU - Microphone luxmark: GPU - Luxball HDR luxmark: CPU+GPU - Microphone luxmark: CPU+GPU - Luxball HDR lulesh-cl: pts_opencl_benchmark 367.734 12.7811 1539.83 45.7399 541.910 13385.5 44825.1 12.9214 13.1919 2491.20 345.7 521.1 514.8 3357 6880 6780 3.66 19644.88 19674.45 518.27 702.62 38349.93 12.86 12.00 21.446 2.670 3.296 112 135 139 59.2 73.4 83.5 84.4 92.5 100.7 102.8 108 112 348 441 380 475 522 523 197 389 627 644 670 687 1.280 1.699 0.119 0.687 1711642497 1711642633 1711642770 17792 17792 55201 82081 55213 82042 8852.8010 OpenBenchmarking.org
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: S3D OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D pts_opencl_benchmark 80 160 240 320 400 SE +/- 1.19, N = 3 367.73 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad pts_opencl_benchmark 3 6 9 12 15 SE +/- 0.00, N = 3 12.78 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP pts_opencl_benchmark 300 600 900 1200 1500 SE +/- 0.63, N = 3 1539.83 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash pts_opencl_benchmark 10 20 30 40 50 SE +/- 0.15, N = 3 45.74 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Reduction OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction pts_opencl_benchmark 120 240 360 480 600 SE +/- 0.39, N = 3 541.91 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: GEMM SGEMM_N OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N pts_opencl_benchmark 3K 6K 9K 12K 15K SE +/- 8.77, N = 3 13385.5 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops pts_opencl_benchmark 10K 20K 30K 40K 50K SE +/- 25.06, N = 3 44825.1 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download pts_opencl_benchmark 3 6 9 12 15 SE +/- 0.00, N = 3 12.92 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback pts_opencl_benchmark 3 6 9 12 15 SE +/- 0.00, N = 3 13.19 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth pts_opencl_benchmark 500 1000 1500 2000 2500 SE +/- 1.98, N = 3 2491.20 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy pts_opencl_benchmark 80 160 240 320 400 SE +/- 0.13, N = 3 345.7 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read pts_opencl_benchmark 110 220 330 440 550 SE +/- 3.05, N = 3 521.1 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write pts_opencl_benchmark 110 220 330 440 550 SE +/- 0.15, N = 3 514.8 1. (CC) gcc options: -O2 -flto -lOpenCL
FluidX3D Test: FP32-FP32 OpenBenchmarking.org MLUPs/s, More Is Better FluidX3D 2.9 Test: FP32-FP32 pts_opencl_benchmark 700 1400 2100 2800 3500 SE +/- 0.33, N = 3 3357
FluidX3D Test: FP32-FP16C OpenBenchmarking.org MLUPs/s, More Is Better FluidX3D 2.9 Test: FP32-FP16C pts_opencl_benchmark 1500 3000 4500 6000 7500 SE +/- 1.53, N = 3 6880
FluidX3D Test: FP32-FP16S OpenBenchmarking.org MLUPs/s, More Is Better FluidX3D 2.9 Test: FP32-FP16S pts_opencl_benchmark 1500 3000 4500 6000 7500 SE +/- 0.67, N = 3 6780
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak 1.1.2 OpenCL Test: Kernel Latency pts_opencl_benchmark 0.8235 1.647 2.4705 3.294 4.1175 SE +/- 0.02, N = 3 3.66 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Integer Compute OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer Compute pts_opencl_benchmark 4K 8K 12K 16K 20K SE +/- 3.83, N = 3 19644.88 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Integer 24-bit Compute OpenBenchmarking.org GIOPS, More Is Better clpeak 1.1.2 OpenCL Test: Integer 24-bit Compute pts_opencl_benchmark 4K 8K 12K 16K 20K SE +/- 2.50, N = 3 19674.45 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Global Memory Bandwidth pts_opencl_benchmark 110 220 330 440 550 SE +/- 0.30, N = 3 518.27 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Double-Precision Compute OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Double-Precision Compute pts_opencl_benchmark 150 300 450 600 750 SE +/- 0.03, N = 3 702.62 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Single-Precision Compute OpenBenchmarking.org GFLOPS, More Is Better clpeak 1.1.2 OpenCL Test: Single-Precision Compute pts_opencl_benchmark 8K 16K 24K 32K 40K SE +/- 4.31, N = 3 38349.93 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueReadBuffer pts_opencl_benchmark 3 6 9 12 15 SE +/- 0.01, N = 3 12.86 1. (CXX) g++ options: -O3
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak 1.1.2 OpenCL Test: Transfer Bandwidth enqueueWriteBuffer pts_opencl_benchmark 3 6 9 12 15 SE +/- 0.01, N = 3 12.00 1. (CXX) g++ options: -O3
Rodinia Test: OpenCL Myocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Myocyte pts_opencl_benchmark 5 10 15 20 25 SE +/- 0.17, N = 3 21.45 1. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl
Rodinia Test: OpenCL Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Leukocyte pts_opencl_benchmark 0.6008 1.2016 1.8024 2.4032 3.004 SE +/- 0.023, N = 8 2.670 1. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl
Rodinia Test: OpenCL Particle Filter OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Particle Filter pts_opencl_benchmark 0.7416 1.4832 2.2248 2.9664 3.708 SE +/- 0.011, N = 3 3.296 1. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl
ViennaCL Test: CPU BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY pts_opencl_benchmark 30 60 90 120 150 SE +/- 0.67, N = 3 112 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY pts_opencl_benchmark 30 60 90 120 150 SE +/- 0.33, N = 3 135 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT pts_opencl_benchmark 30 60 90 120 150 SE +/- 11.17, N = 3 139 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY pts_opencl_benchmark 13 26 39 52 65 SE +/- 0.07, N = 3 59.2 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY pts_opencl_benchmark 16 32 48 64 80 SE +/- 0.03, N = 3 73.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT pts_opencl_benchmark 20 40 60 80 100 SE +/- 0.20, N = 3 83.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N pts_opencl_benchmark 20 40 60 80 100 SE +/- 4.28, N = 3 84.4 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T pts_opencl_benchmark 20 40 60 80 100 SE +/- 1.89, N = 3 92.5 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN pts_opencl_benchmark 20 40 60 80 100 SE +/- 2.16, N = 3 100.7 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT pts_opencl_benchmark 20 40 60 80 100 SE +/- 1.91, N = 3 102.8 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT pts_opencl_benchmark 20 40 60 80 100 SE +/- 0.88, N = 3 108 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN pts_opencl_benchmark 30 60 90 120 150 SE +/- 1.50, N = 2 112 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sCOPY pts_opencl_benchmark 80 160 240 320 400 SE +/- 0.33, N = 3 348 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sAXPY pts_opencl_benchmark 100 200 300 400 500 SE +/- 0.00, N = 3 441 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - sDOT pts_opencl_benchmark 80 160 240 320 400 SE +/- 0.00, N = 3 380 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dCOPY pts_opencl_benchmark 100 200 300 400 500 SE +/- 0.33, N = 3 475 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dAXPY pts_opencl_benchmark 110 220 330 440 550 SE +/- 0.00, N = 3 522 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dDOT pts_opencl_benchmark 110 220 330 440 550 523 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-N pts_opencl_benchmark 40 80 120 160 200 SE +/- 0.00, N = 3 197 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMV-T pts_opencl_benchmark 80 160 240 320 400 SE +/- 0.33, N = 3 389 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NN pts_opencl_benchmark 140 280 420 560 700 SE +/- 0.33, N = 3 627 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-NT pts_opencl_benchmark 140 280 420 560 700 SE +/- 0.00, N = 3 644 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TN pts_opencl_benchmark 140 280 420 560 700 SE +/- 0.33, N = 3 670 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
ViennaCL Test: OpenCL BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: OpenCL BLAS - dGEMM-TT pts_opencl_benchmark 150 300 450 600 750 SE +/- 0.33, N = 3 687 1. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.8.1 Test: Boat - Acceleration: OpenCL pts_opencl_benchmark 0.288 0.576 0.864 1.152 1.44 SE +/- 0.002, N = 3 1.280
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.8.1 Test: Masskrug - Acceleration: OpenCL pts_opencl_benchmark 0.3823 0.7646 1.1469 1.5292 1.9115 SE +/- 0.016, N = 3 1.699
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.8.1 Test: Server Rack - Acceleration: OpenCL pts_opencl_benchmark 0.0268 0.0536 0.0804 0.1072 0.134 SE +/- 0.001, N = 3 0.119
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.8.1 Test: Server Room - Acceleration: OpenCL pts_opencl_benchmark 0.1546 0.3092 0.4638 0.6184 0.773 SE +/- 0.006, N = 3 0.687
SmallPT GPU OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Caustic OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Caustic pts_opencl_benchmark 400M 800M 1200M 1600M 2000M SE +/- 25.12, N = 3 1711642497 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Cornell OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Cornell pts_opencl_benchmark 400M 800M 1200M 1600M 2000M SE +/- 24.54, N = 3 1711642633 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Caustic3 OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Caustic3 pts_opencl_benchmark 400M 800M 1200M 1600M 2000M SE +/- 25.12, N = 3 1711642770 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
LuxMark OpenCL Device: GPU - Scene: Hotel OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Hotel pts_opencl_benchmark 4K 8K 12K 16K 20K SE +/- 43.31, N = 3 17792
LuxMark OpenCL Device: CPU+GPU - Scene: Hotel OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: CPU+GPU - Scene: Hotel pts_opencl_benchmark 4K 8K 12K 16K 20K SE +/- 20.33, N = 3 17792
LuxMark OpenCL Device: GPU - Scene: Microphone OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Microphone pts_opencl_benchmark 12K 24K 36K 48K 60K SE +/- 127.22, N = 3 55201
LuxMark OpenCL Device: GPU - Scene: Luxball HDR OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Luxball HDR pts_opencl_benchmark 20K 40K 60K 80K 100K SE +/- 34.35, N = 3 82081
LuxMark OpenCL Device: CPU+GPU - Scene: Microphone OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: CPU+GPU - Scene: Microphone pts_opencl_benchmark 12K 24K 36K 48K 60K SE +/- 151.98, N = 3 55213
LuxMark OpenCL Device: CPU+GPU - Scene: Luxball HDR OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: CPU+GPU - Scene: Luxball HDR pts_opencl_benchmark 20K 40K 60K 80K 100K SE +/- 14.75, N = 3 82042
Lulesh OpenCL OpenBenchmarking.org z/s, More Is Better Lulesh OpenCL 2017-07-06 pts_opencl_benchmark 2K 4K 6K 8K 10K SE +/- 67.92, N = 3 8852.80 1. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm
Phoronix Test Suite v10.8.5