opencl_works_20221029

AMD Ryzen 9 6900HS testing with a ASUS GA402RJ v1.0 (GA402RJ.315 BIOS) and ASUS AMD Radeon RX 6650 XT 8GB on openSUSE 20221027 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2210296-NE-OPENCLWOR07
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
opencl_works_20221029
October 29 2022
  2 Hours, 7 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


opencl_works_20221029OpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 6900HS @ 3.30GHz (8 Cores / 16 Threads)ASUS GA402RJ v1.0 (GA402RJ.315 BIOS)AMD Device 14b58 GB + 32 GB DDR5-4800MT/s1024GB Micron_2450_MTFDKBA1T0TFKASUS AMD Radeon RX 6650 XT 8GB (2400/1000MHz)AMD Navi 21/23MEDIATEK Device 7922openSUSE 202210276.0.3-1-default (x86_64)KDE PlasmaX Server 1.21.1.4OpenCL 2.1 AMD-APP (3486.0)1.3.224GCC 12.2.1 20221020 [revision 0aaef83351473e8f4eb774f8f999bbe87a4866d7] + Clang 15.0.2 + LLVM 7.0.1btrfs2560x1600ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerOpenCLVulkanCompilerFile-SystemScreen ResolutionOpencl_works_20221029 BenchmarksSystem Logs- amdgpu.ppfeaturemask=0xffffffff - Transparent Huge Pages: always- DRI_PRIME=1- --build=x86_64-suse-linux --disable-libcc1 --disable-libssp --disable-libstdcxx-pch --disable-libvtv --disable-werror --enable-cet=auto --enable-checking=release --enable-gnu-indirect-function --enable-host-shared --enable-languages=c,c++,objc,fortran,obj-c++,ada,go,d,jit --enable-libphobos --enable-libstdcxx-allocator=new --enable-link-mutex --enable-linux-futex --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none,amdgcn-amdhsa, --enable-plugin --enable-ssp --enable-version-specific-runtime-libs --host=x86_64-suse-linux --mandir=/usr/share/man --with-arch-32=x86-64 --with-build-config=bootstrap-lto-lean --with-gcc-major-version-only --with-slibdir=/lib64 --with-tune=generic --without-cuda-driver --without-system-libunwind - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - Platform Profile: balanced - CPU Microcode: 0xa404102 - ACPI Profile: balanced - GLAMOR - vBIOS Version: 113-REMBRANDT-X37- Python 2.7.18 + Python 3.10.7- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

opencl_works_20221029clpeak: Transfer Bandwidth enqueueWriteBufferclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Global Memory Bandwidthclpeak: Double-Precision Doubleclpeak: Single-Precision Floatclpeak: Integer Compute INTclpeak: Kernel Latencydarktable: Server Room - OpenCLdarktable: Server Rack - OpenCLdarktable: Masskrug - OpenCLdarktable: Boat - OpenCLrodinia: OpenCL Leukocyterodinia: OpenCL Myocytecl-mem: Writecl-mem: Readcl-mem: Copyshoc: OpenCL - Texture Read Bandwidthshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - MD5 Hashshoc: OpenCL - FFT SPshoc: OpenCL - Triadshoc: OpenCL - Max SP Flopsopencl_works_2022102949.167.65201.05533.478312.792102.7710.240.7820.4453.0992.5864.05645.731190.6204.5177.3562.98714.057114.323010.7147572.16612.389415925100OpenBenchmarking.org

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferopencl_works_202210291122334455SE +/- 0.61, N = 449.161. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferopencl_works_20221029246810SE +/- 0.08, N = 37.651. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory Bandwidthopencl_works_202210294080120160200SE +/- 0.03, N = 3201.051. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision Doubleopencl_works_20221029120240360480600SE +/- 0.14, N = 3533.471. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision Floatopencl_works_202210292K4K6K8K10KSE +/- 30.16, N = 38312.791. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTopencl_works_202210295001000150020002500SE +/- 1.46, N = 32102.771. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel Latencyopencl_works_202210293691215SE +/- 0.12, N = 310.241. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.0.1Test: Server Room - Acceleration: OpenCLopencl_works_202210290.1760.3520.5280.7040.88SE +/- 0.000, N = 30.782

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.0.1Test: Server Rack - Acceleration: OpenCLopencl_works_202210290.10010.20020.30030.40040.5005SE +/- 0.002, N = 30.445

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.0.1Test: Masskrug - Acceleration: OpenCLopencl_works_202210290.69731.39462.09192.78923.4865SE +/- 0.006, N = 33.099

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 4.0.1Test: Boat - Acceleration: OpenCLopencl_works_202210290.58191.16381.74572.32762.9095SE +/- 0.018, N = 32.586

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Leukocyteopencl_works_202210290.91261.82522.73783.65044.563SE +/- 0.022, N = 34.0561. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Myocyteopencl_works_202210291020304050SE +/- 0.11, N = 345.731. (CXX) g++ options: -O2 -lOpenCL

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Writeopencl_works_202210294080120160200SE +/- 0.17, N = 3190.61. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Readopencl_works_202210294080120160200SE +/- 0.12, N = 3204.51. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: Copyopencl_works_202210294080120160200SE +/- 0.00, N = 3177.31. (CC) gcc options: -O2 -flto -lOpenCL

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read Bandwidthopencl_works_20221029120240360480600SE +/- 6.67, N = 4562.991. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Readbackopencl_works_2022102948121620SE +/- 0.01, N = 314.061. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Downloadopencl_works_2022102948121620SE +/- 0.00, N = 314.321. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 Hashopencl_works_202210293691215SE +/- 0.00, N = 310.711. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPopencl_works_20221029120240360480600SE +/- 0.41, N = 3572.171. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triadopencl_works_202210293691215SE +/- 0.09, N = 312.391. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Scene: Luxball HDR

opencl_works_20221029: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

OpenCL Device: GPU - Scene: Microphone

opencl_works_20221029: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

OpenCL Device: GPU - Scene: Hotel

opencl_works_20221029: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

SmallPT GPU

SmallPT GPU is an OpenCL benchmark that's run with various PTS changes compared to upstream and multiple rendering scenes are available. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT - Scene: Caustic3

opencl_works_20221029: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT - Scene: Cornell

opencl_works_20221029: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT - Scene: Caustic

opencl_works_20221029: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT

opencl_works_20221029: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

MandelbulbGPU

MandelbulbGPU is an OpenCL benchmark. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT

opencl_works_20221029: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

JuliaGPU

JuliaGPU is an OpenCL benchmark with this version containing various PTS-specific enhancements. Learn more via the OpenBenchmarking.org test page.

OpenCL Device: GPU - Resolution: $VIDEO_WIDTH x $VIDEO_HEIGHT

opencl_works_20221029: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Blender

Blend File: Barbershop - Compute: OpenCL

opencl_works_20221029: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: Error: Not freed memory blocks: 4, total unfreed memory 0.000427 MB

Blend File: BMW27 - Compute: OpenCL

opencl_works_20221029: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: Error: Cannot read file 'blender-3.3.0-linux-x64/OPENCL': No such file or directory

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

Test: OpenCL Particle Filter

opencl_works_20221029: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. E: ERROR: clEnqueueNDRangeKernel(kernel_likelihood)=>-54 failed

Test: OpenCL Heartwall

opencl_works_20221029: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Test: OpenCL LavaMD

opencl_works_20221029: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: Error: Failed to compile source (from CL or HIP source to LLVM IR).

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

Test: OpenCL MRI Gridding

opencl_works_20221029: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Test: OpenCL TPACF

opencl_works_20221029: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: Error: clGetPlatformIDs!

Test: OpenCL Histo

opencl_works_20221029: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Test: OpenCL LBM

opencl_works_20221029: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: Error: clGetPlatformIDs!

Test: OpenCL BFS

opencl_works_20221029: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP Flopsopencl_works_202210293M6M9M12M15MSE +/- 1219971.29, N = 6159251001. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt