ptsopenclbenchmark

Intel Core i9-13980HX testing with a MSI MS-17S1 (E17S1IMS.30D BIOS) and MSI Intel RPL-S 16GB on Ubuntu 22.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2403285-NE-PTSOPENCL91
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
pts_opencl_benchmark
March 28
  2 Hours, 43 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


ptsopenclbenchmarkOpenBenchmarking.orgPhoronix Test SuiteIntel Core i9-13980HX @ 5.60GHz (24 Cores / 32 Threads)MSI MS-17S1 (E17S1IMS.30D BIOS)Intel Device 7a2732GB2048GB Micron_3400_MTFDKBA2T0TFHMSI Intel RPL-S 16GB (210/405MHz)Intel Device 7a50PA248Realtek RTL8125 2.5GbE + Intel Device 7a70Ubuntu 22.046.5.0-26-generic (x86_64)Cinnamon 5.2.7X Server 1.21.1.4NVIDIA 535.161.074.6 Mesa 23.2.1-1ubuntu3.1~22.04.2OpenCL 3.0 CUDA 12.2.1481.3.255GCC 11.4.0 + CUDA 12.2ext41920x1200ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionPtsopenclbenchmark PerformanceSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x11d - Thermald 2.4.9 - BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.2a.00.2a- GPU Compute Cores: 9728- Python 3.10.12- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected

ptsopenclbenchmarkshoc: OpenCL - S3Dshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Reductionshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthcl-mem: Copycl-mem: Readcl-mem: Writefluidx3d: FP32-FP32fluidx3d: FP32-FP16Cfluidx3d: FP32-FP16Sclpeak: Kernel Latencyclpeak: Integer Computeclpeak: Integer 24-bit Computeclpeak: Global Memory Bandwidthclpeak: Double-Precision Computeclpeak: Single-Precision Computeclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferrodinia: OpenCL Myocyterodinia: OpenCL Leukocyterodinia: OpenCL Particle Filterviennacl: CPU BLAS - sCOPYviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-TTviennacl: CPU BLAS - dGEMM-TNviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-TTdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Rack - OpenCLdarktable: Server Room - OpenCLsmallpt-gpu: GPU - 1920 x 1200 - Causticsmallpt-gpu: GPU - 1920 x 1200 - Cornellsmallpt-gpu: GPU - 1920 x 1200 - Caustic3luxmark: GPU - Hotelluxmark: CPU+GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRluxmark: CPU+GPU - Microphoneluxmark: CPU+GPU - Luxball HDRlulesh-cl: pts_opencl_benchmark367.73412.78111539.8345.7399541.91013385.544825.112.921413.19192491.20345.7521.1514.83357688067803.6619644.8819674.45518.27702.6238349.9312.8612.0021.4462.6703.29611213513959.273.483.584.492.5100.7102.81081123484413804755225231973896276446706871.2801.6990.1190.6871711642497171164263317116427701779217792552018208155213820428852.8010OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3Dpts_opencl_benchmark80160240320400SE +/- 1.19, N = 3367.731. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triadpts_opencl_benchmark3691215SE +/- 0.00, N = 312.781. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPpts_opencl_benchmark30060090012001500SE +/- 0.63, N = 31539.831. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 Hashpts_opencl_benchmark1020304050SE +/- 0.15, N = 345.741. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Reductionpts_opencl_benchmark120240360480600SE +/- 0.39, N = 3541.911. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_Npts_opencl_benchmark3K6K9K12K15KSE +/- 8.77, N = 313385.51. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP Flopspts_opencl_benchmark10K20K30K40K50KSE +/- 25.06, N = 344825.11. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Downloadpts_opencl_benchmark3691215SE +/- 0.00, N = 312.921. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Readbackpts_opencl_benchmark3691215SE +/- 0.00, N = 313.191. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read Bandwidthpts_opencl_benchmark5001000150020002500SE +/- 1.98, N = 32491.201. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Copypts_opencl_benchmark80160240320400SE +/- 0.13, N = 3345.71. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Readpts_opencl_benchmark110220330440550SE +/- 3.05, N = 3521.11. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Writepts_opencl_benchmark110220330440550SE +/- 0.15, N = 3514.81. (CC) gcc options: -O2 -flto -lOpenCL

FluidX3D

FluidX3D is a speedy and memory efficient Boltzmann CFD (Computational Fluid Dynamics) software package implemented using OpenCL and intended for GPU acceleration. FluidX3D is developed by Moritz Lehmann and written free for non-commercial use. This is a test profile measuring the system OpenCL performance using the FluidX3D benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP32pts_opencl_benchmark7001400210028003500SE +/- 0.33, N = 33357

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16Cpts_opencl_benchmark15003000450060007500SE +/- 1.53, N = 36880

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16Spts_opencl_benchmark15003000450060007500SE +/- 0.67, N = 36780

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus, Fewer Is Betterclpeak 1.1.2OpenCL Test: Kernel Latencypts_opencl_benchmark0.82351.6472.47053.2944.1175SE +/- 0.02, N = 33.661. (CXX) g++ options: -O3

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer Computepts_opencl_benchmark4K8K12K16K20KSE +/- 3.83, N = 319644.881. (CXX) g++ options: -O3

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit Computepts_opencl_benchmark4K8K12K16K20KSE +/- 2.50, N = 319674.451. (CXX) g++ options: -O3

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory Bandwidthpts_opencl_benchmark110220330440550SE +/- 0.30, N = 3518.271. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision Computepts_opencl_benchmark150300450600750SE +/- 0.03, N = 3702.621. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision Computepts_opencl_benchmark8K16K24K32K40KSE +/- 4.31, N = 338349.931. (CXX) g++ options: -O3

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueReadBufferpts_opencl_benchmark3691215SE +/- 0.01, N = 312.861. (CXX) g++ options: -O3

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueWriteBufferpts_opencl_benchmark3691215SE +/- 0.01, N = 312.001. (CXX) g++ options: -O3

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

Test: OpenCL BFS

pts_opencl_benchmark: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: /usr/bin/env: ‘python’: No such file or directory

Test: OpenCL LBM

pts_opencl_benchmark: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: /usr/bin/env: ‘python’: No such file or directory

Test: OpenCL Histo

pts_opencl_benchmark: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: /usr/bin/env: ‘python’: No such file or directory

Test: OpenCL TPACF

pts_opencl_benchmark: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: /usr/bin/env: ‘python’: No such file or directory

Test: OpenCL MRI Gridding

pts_opencl_benchmark: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: /usr/bin/env: ‘python’: No such file or directory

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

Test: OpenCL LavaMD

pts_opencl_benchmark: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: :129:14: error: variables in the local address space can only be declared in the outermost scope of a kernel function

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Myocytepts_opencl_benchmark510152025SE +/- 0.17, N = 321.451. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl

Test: OpenCL Heartwall

pts_opencl_benchmark: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Leukocytepts_opencl_benchmark0.60081.20161.80242.40323.004SE +/- 0.023, N = 82.6701. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle Filterpts_opencl_benchmark0.74161.48322.22482.96643.708SE +/- 0.011, N = 33.2961. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYpts_opencl_benchmark306090120150SE +/- 0.67, N = 31121. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYpts_opencl_benchmark306090120150SE +/- 0.33, N = 31351. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTpts_opencl_benchmark306090120150SE +/- 11.17, N = 31391. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYpts_opencl_benchmark1326395265SE +/- 0.07, N = 359.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYpts_opencl_benchmark1632486480SE +/- 0.03, N = 373.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTpts_opencl_benchmark20406080100SE +/- 0.20, N = 383.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-Npts_opencl_benchmark20406080100SE +/- 4.28, N = 384.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-Tpts_opencl_benchmark20406080100SE +/- 1.89, N = 392.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNpts_opencl_benchmark20406080100SE +/- 2.16, N = 3100.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTpts_opencl_benchmark20406080100SE +/- 1.91, N = 3102.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTpts_opencl_benchmark20406080100SE +/- 0.88, N = 31081. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNpts_opencl_benchmark306090120150SE +/- 1.50, N = 21121. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYpts_opencl_benchmark80160240320400SE +/- 0.33, N = 33481. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYpts_opencl_benchmark100200300400500SE +/- 0.00, N = 34411. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTpts_opencl_benchmark80160240320400SE +/- 0.00, N = 33801. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYpts_opencl_benchmark100200300400500SE +/- 0.33, N = 34751. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYpts_opencl_benchmark110220330440550SE +/- 0.00, N = 35221. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTpts_opencl_benchmark1102203304405505231. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-Npts_opencl_benchmark4080120160200SE +/- 0.00, N = 31971. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-Tpts_opencl_benchmark80160240320400SE +/- 0.33, N = 33891. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNpts_opencl_benchmark140280420560700SE +/- 0.33, N = 36271. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTpts_opencl_benchmark140280420560700SE +/- 0.00, N = 36441. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNpts_opencl_benchmark140280420560700SE +/- 0.33, N = 36701. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTpts_opencl_benchmark150300450600750SE +/- 0.33, N = 36871. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Boat - Acceleration: OpenCLpts_opencl_benchmark0.2880.5760.8641.1521.44SE +/- 0.002, N = 31.280

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Masskrug - Acceleration: OpenCLpts_opencl_benchmark0.38230.76461.14691.52921.9115SE +/- 0.016, N = 31.699

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Server Rack - Acceleration: OpenCLpts_opencl_benchmark0.02680.05360.08040.10720.134SE +/- 0.001, N = 30.119

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.8.1Test: Server Room - Acceleration: OpenCLpts_opencl_benchmark0.15460.30920.46380.61840.773SE +/- 0.006, N = 30.687

Xsbench OpenCL

Xsbench benchmark in OpenCL via GPUOpen. Learn more via the OpenBenchmarking.org test page.

pts_opencl_benchmark: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

JuliaGPU

JuliaGPU is an OpenCL benchmark with this version containing various PTS-specific enhancements. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT

pts_opencl_benchmark: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

MandelbulbGPU

MandelbulbGPU is an OpenCL benchmark. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT

pts_opencl_benchmark: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT

pts_opencl_benchmark: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

SmallPT GPU

SmallPT GPU is an OpenCL benchmark that's run with various PTS changes compared to upstream and multiple rendering scenes are available. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Causticpts_opencl_benchmark400M800M1200M1600M2000MSE +/- 25.12, N = 317116424971. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Cornellpts_opencl_benchmark400M800M1200M1600M2000MSE +/- 24.54, N = 317116426331. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Resolution: 1920 x 1200 - Scene: Caustic3pts_opencl_benchmark400M800M1200M1600M2000MSE +/- 25.12, N = 317116427701. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: CPU - Scene: Hotel

pts_opencl_benchmark: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: Error: RUNTIME ERROR: No OpenCL device selected or available

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Hotelpts_opencl_benchmark4K8K12K16K20KSE +/- 43.31, N = 317792

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: Hotelpts_opencl_benchmark4K8K12K16K20KSE +/- 20.33, N = 317792

OpenCL Device: CPU - Scene: Microphone

pts_opencl_benchmark: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: Error: RUNTIME ERROR: No OpenCL device selected or available

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Microphonepts_opencl_benchmark12K24K36K48K60KSE +/- 127.22, N = 355201

OpenCL Device: CPU - Scene: Luxball HDR

pts_opencl_benchmark: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: Error: RUNTIME ERROR: No OpenCL device selected or available

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDRpts_opencl_benchmark20K40K60K80K100KSE +/- 34.35, N = 382081

OpenCL Device: Hybrid GPU - Scene: Hotel

pts_opencl_benchmark: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: Microphonepts_opencl_benchmark12K24K36K48K60KSE +/- 151.98, N = 355213

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: CPU+GPU - Scene: Luxball HDRpts_opencl_benchmark20K40K60K80K100KSE +/- 14.75, N = 382042

OpenCL Device: Hybrid GPU - Scene: Microphone

pts_opencl_benchmark: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenCL Device: Hybrid GPU - Scene: Luxball HDR

pts_opencl_benchmark: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenDwarfs

OpenDwarfs is a non-commercial OpenCL compute benchmark suite developed at Virginia Tech in cooperation with various organizations. Learn more via the OpenBenchmarking.org test page.

Test: LU Decomposition

pts_opencl_benchmark: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Test: Compressed Sparse Row

pts_opencl_benchmark: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Test: Cyclic Redundancy Check

pts_opencl_benchmark: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Lulesh OpenCL

Lulesh OpenCL benchmark: Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLulesh OpenCL 2017-07-06pts_opencl_benchmark2K4K6K8K10KSE +/- 67.92, N = 38852.801. (CXX) g++ options: -std=c++11 -lOpenCL -O3 -lm

65 Results Shown

SHOC Scalable HeterOgeneous Computing:
  OpenCL - S3D
  OpenCL - Triad
  OpenCL - FFT SP
  OpenCL - MD5 Hash
  OpenCL - Reduction
  OpenCL - GEMM SGEMM_N
  OpenCL - Max SP Flops
  OpenCL - Bus Speed Download
  OpenCL - Bus Speed Readback
  OpenCL - Texture Read Bandwidth
cl-mem:
  Copy
  Read
  Write
FluidX3D:
  FP32-FP32
  FP32-FP16C
  FP32-FP16S
clpeak:
  Kernel Latency
  Integer Compute
  Integer 24-bit Compute
  Global Memory Bandwidth
  Double-Precision Compute
  Single-Precision Compute
  Transfer Bandwidth enqueueReadBuffer
  Transfer Bandwidth enqueueWriteBuffer
Rodinia:
  OpenCL Myocyte
  OpenCL Leukocyte
  OpenCL Particle Filter
ViennaCL:
  CPU BLAS - sCOPY
  CPU BLAS - sAXPY
  CPU BLAS - sDOT
  CPU BLAS - dCOPY
  CPU BLAS - dAXPY
  CPU BLAS - dDOT
  CPU BLAS - dGEMV-N
  CPU BLAS - dGEMV-T
  CPU BLAS - dGEMM-NN
  CPU BLAS - dGEMM-NT
  CPU BLAS - dGEMM-TT
  CPU BLAS - dGEMM-TN
  OpenCL BLAS - sCOPY
  OpenCL BLAS - sAXPY
  OpenCL BLAS - sDOT
  OpenCL BLAS - dCOPY
  OpenCL BLAS - dAXPY
  OpenCL BLAS - dDOT
  OpenCL BLAS - dGEMV-N
  OpenCL BLAS - dGEMV-T
  OpenCL BLAS - dGEMM-NN
  OpenCL BLAS - dGEMM-NT
  OpenCL BLAS - dGEMM-TN
  OpenCL BLAS - dGEMM-TT
Darktable:
  Boat - OpenCL
  Masskrug - OpenCL
  Server Rack - OpenCL
  Server Room - OpenCL
SmallPT GPU:
  GPU - 1920 x 1200 - Caustic
  GPU - 1920 x 1200 - Cornell
  GPU - 1920 x 1200 - Caustic3
LuxMark:
  GPU - Hotel
  CPU+GPU - Hotel
  GPU - Microphone
  GPU - Luxball HDR
  CPU+GPU - Microphone
  CPU+GPU - Luxball HDR
Lulesh OpenCL